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5 TITLE OF THE INVENTION 
DIABETES GENE 



BACKGROUND OF THE INVENTION 
10 The present invention relates generally to the field of human genetics. Specifically, the 

present invention relates to methods and materials used to isolate and detect human diabetes 
mellitus predisposing gene, specifically the angiotensinogen (AGT) gene, some mutant alleles of 
which cause susceptibility to insulin-dependent diabetes melitus (IDDM). More specifically, the 
invention relates to germline mutations in the AGT gene and their use in the diagnosis of 
15 predisposition to IDDM. The invention also relates to the prophylaxis and/or therapy of diabetes 
associated with a mutation in the AGT gene. The invention further relates to the screening of 
drugs for diabetes therapy. Finally, the invention relates to the screening of the AGT gene for 
mutations, which are useful for diagnosing the predisposition to diabetes. 

The publications and other materials used herein to illuminate the background of the 
20 invention, and in particular, cases to provide additional details respecting the practice, are 
incorporated herein by reference, and for convenience, are referenced by author and date in the 
following text and respectively grouped in the appended List of References. 

Diabetes mellitus is among the most common of all metabolic disorders, affecting up to 
11% of the population by age 70. Type I diabetes (insulin dependent diabetes mellitus or 
25 IDDM) represents about 5 to 1 0% of this group and is the result of a progressive autoimmune 
destruction of the pancreatic P-cells with subsequent insulin deficiency. IDDM is characterized 
by a partial or complete inability to produce insulin. Patients with IDDM would die without 
daily insulin injections to control their disease. 

Few advancements in resolving the pathogenesis of diabetes were made until the mid- 
30 1970s when evidence began to accumulate to suggest that IDDM had an autoimmune 
etiopathogenesis. It is now generally accepted that IDDM results from the chronic autoimmune 
destruction of the insulin producing pancreatic P-cells. Lymphocytes and other inflammatory 
cells have been observed within the islets of Langerhans in newly diagnosed IDDM patients and 
have been found preferentially in regenerating islets composed of P-cells rather than those of 
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other cell types. This active immunological process is associated with a variety of autoantibodies 
to p-cells cytoplasmic and membrane antigens, insulin, and insulin receptors. Although the 
general mechanism by which IDDM occurs is known, IDDM becomes clinically evident only 
after the vast majority of pancreatic P-cells have been irrevocably destroyed and the individual 
5 becomes dependent upon exogenous insulin. 

In both humans and diabetes-prone non-obese diabetic (NOD) mice, genes mapping to 
the major histocompatibility complex have been associated with susceptibility to IDDM and 
shown to be very important in the disease process (Todd, 1990). Studies of NOD mice have 
mapped at least 12 other susceptibility genes to specific chromosomal locations (Prochazaka et 

10 al. f 1987; Todd et al. 5 1991; De Gouyon et al., 1993; Morahan et al., 1994; Serreze et al., 1994; 
Cornall et al., 1991; Garchon et al., 1991). In humans, markers near the insulin/insulin-like 
growth factor loci have also been associated with IDDM (Bell et al., 1984). 

Epidemiological and molecular genetic studies indicate that only 40% of the genetic 
susceptibility is due to alleles of the MHC class II genes, suggesting additional non-MHC genes 

15 are involved (Todd et al., 1988; Buzzetti. et al., 1998). Genome-wide searches and analyses of 
specific genes have identified more at least 17 loci that contribute to the disease (Buzzetti et al., 
1998; Todd, 1995; Concannon et al., 1998; Hashimoto et al., 1994; Davies et al., 1994; Mein et 
al., 1998; Verge et al., 1998). Significant evidence for linkage was reported for a ~7cM region 
on chromosome lq42-43 (Concannon et al., 1998). This region of chromosome 1 contains the 

20 angiotensinogen {ACT) gene. 

Non-insulin dependent diabetes mellitus (NIDDM) is one of the most common inherited 
diseases in man with an estimated prevalence in Caucasian populations of 8-10% (Harris et al., 
1987). It is estimated to affect more than 100 million people worldwide (King and Zimmer, 
1988; Harris et al., 1992). Clinically, NIDDM is a heterogenous disorder characterized by 

25 chronic hyperglycemia leading to progressive micro- and macrovascular lesions in the 
cardiovascular, renal and visual systems as well as diabetic neuropathy. The causes of the fasting 
hyperglycemia and/or glucose intolerance associated with this form of diabetes are not well 
understood. Unfortunately, the disease is associated with early morbidity and mortality. 
Subtypes of the disease can be identified based at least to some degree on the time of onset of 

30 the symptoms. The principal type of NIDDM occurs at a later time of onset, typically at midlife. 
Early-onset NIDDM or maturity-onset diabetes of the young (MODY) shares many features 
with the more common form(s) of NIDDM but onset occurs in early life. 
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Although most forms of NIDDM do not exhibit simple Mendelian inheritance, the 
contribution of heredity is well recognized (Rotter et ah, 1990). The genetic basis of a few rare 
monogenic syndromes of NIDDM have been elucidated, but together, these syndromes account 
for a very small minority of cases (Taylor et al., 1992; Froguel et al., 1993; Steiner et al., 1990; 
5 Kadowaki et aL, 1994). It is likely that the common forms of NIDDM are complex and 
heterogenous, and result when a pool of mutant genes, each of which contributes modestly and 
in a subtle way, interact with each other and with environmental, aging and behavioral 
influences to lead to the expression of the disease. This pool of genes may vary between 
populations and among individuals within a population, despite the illusion of a clinically 

1 0 homogenous phenotype. 

Certain loci have been linked to rare early-onset forms of Type II diabetes that is 
associated with chronic hyperglycemia and monogenic inheritance (i.e. maturity onset diabetes 
of the young (MODY) loci) (Bell et al., 1991; Froguel et al., 1992; Hattersley et al., 1992; 
Vaxillaire et aL, 1995). The defects in the glucokinase (GCK) gene on human chromosome 7 

15 have been found to be responsible for the relatively rare MODY2 phenotype. (Froguel et al., 
1992). 

The genes responsible for MODY1 and MODY3 have been identified to be transcription 
factors hepatocyte nuclear factor 4 (HNF4) and hepatocyte nuclear factor 1 HNF1, repsectively. 
(Yamagata et al., 1996a; Yamagata et al. 1996b). The diabetes gene HNF4/MODY1 regulates 

20 expression of the AGT gene (Yanai et al., 1999). Mutations in HNF4/MODY1 have been 
demonstrated to cause predisposition to maturity onset diabetes of the young.(MODY) a 
genetically heterogenouse monogenic form of non-insulin-dependent diabetes mellitus 
(NIDDM) (Yamagata et al., 1996b; Lindner et al., 1997). 

Another locus has been identified for a rare early-onset form with mitochondrial 

25 inheritance (Van den Ouwenland et al., 1992). In addition, Harris et al. (1996) identified a locus 
of NIDDM 1 on chromosome 2 that appears to play a role in Mexican American diabetes. 
Further, Mahtani et al. (1996) report evidence of the existence of a gene on human chromosome 
12, NIDDM2, that causes NIDDM associated with low insulin secretion. The paper suggests that 
NIDDM2 and MODY3 represent different alleles of the same gene with severe mutations 

30 causing MODY3 and milder mutations giving rise to later-onset NIDDM characterized by low 
insulin secretion. 
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In view of the heterogeneity of genes associated with IDDM or NIDDM, it is desired to . 
identify additional genes associated with IDDM or NIDDM for diagnostic and therapeutic 
purposes. 

SUMMARY OF THE INVENTION 

Tight linkage between DNA polymorphisms in and near the angiotensinogen (AGT) 
gene on chromosome 1 and diabetes in particular families was observed (Concannon, 1999). 
This invention provides the first evidence implicating specific mutations in the AGT gene with 
susceptibility to IDDM. 

Thus, in one aspect of the invention, novel nucleic acids containing additional genomic 
sequences are provided. These nucleic acids are set forth in SEQ ID NOs:l-5 and 8. 

In a second aspect of the invention, a method for detecting a susceptibility in an 
individual to insulin-dependent diabetes mellitus is provided. Thus, the present invention 
provides methods for determining whether a subject is at risk for developing diabetes due to a 
mutation in the AGT gene. This method relies on the fact that mutations in the AGT have been 
correlated by the inventors with the disease. It will be understood by those of skill in the art, 
given the disclosure of the invention that such mutations are associated with a susceptability to 
diabetes, that a variety of methods may be utilized to detect mutations in the AGT gene, 
including the mutations disclosed herein, which are associated with a susceptability to IDDM. 

The method can include detecting, in a tissue of the subject, the presence or absence of a 
polymorphism of the AGT gene. The detection of a polymorphism in the AGT gene may 
include ascertaining the existence of at least one of: a deletion of one or more nucleotides; an 
addition of one or more nucleotides, a substitution of one or more nucleotides; a gross 
chromosomal rearrangement; an alteration in the level of a messenger RNA transcript; the 
presence of a non-wild type splicing pattern of a messenger RjNA transcript; a non-wild type 
level of an AGT protein; and/or an aberrant level of an AGT protein. 

For example, detecting the polymorphism can include (i) providing a probe/primer 
comprised of an oligonucleotide which hybridizes to a sense or antisense sequence of an AGT 
gene or naturally occurring mutants thereof, or 5' or 3 f flanking sequences naturally associated 
with an AGT gene; (ii) contacting the probe/primer to an appropriate nucleic acid containing 
sample; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the presence 
or absence of the polymorphism; e.g. wherein detecting the polymorphism comprises utilizing 
the probe/primer to determine the nucleotide sequence of an AGT gene and, optionally, of the 
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flanking nucleic acid sequences. For instance, the primer can be employed in a polymerase 
chain reaction (PCR), in a ligase chain reaction (LCR) or other amplification reactions known to 
a skilled artisan. In alternate embodiments, the level of an AGT protein is detected in an 
immunoassay using an antibody which is specifically immunoreactive with the AGT protein. 
5 In a third aspect of the invention, compounds that are agonists or antagonists of a normal 

(functional) AGT bioactivity and their use in preventing or treating diabetes are provided. For 
example, to ameliorate disease symptoms involving insufficient expression of an AGT gene 
and/or inadequate amount of functional AGT bioactivity in a subject, a gene therapeutic 
(comprising a gene encoding a functional AGT protein) or a protein therapeutic (comprising a 

10 functional AGT protein or fragment thereof) can be administered to the subject. Alternatively, 
agonists or antagonists of AGT function (wild-type or mutant) or an AGT receptor or a receptor 
for fragments of AGT can be administered. 

In a fourth aspect of the invention, compounds that are antagonists of a disease causing 
AGT bioactivity and their use in preventing or treating diabetes are provided. For example, to 

15 ameliorate disease symptoms involving expression of a mutant AGT gene or aberrant expression 
of a normal AGT gene in a subject, a therapeutically effective amount of an antisense, ribozyme 
or triple helix molecule to reduce or prevent gene expression may be administered to the subject 
Alternatively, to ameliorate disease symptoms involving the regulation via an AGT protein or 
AGT protein fragments of an upstream or downstream element in an AGT mediated biochemical 

20 pathway (e.g. signal transduction), a therapeutically effective amount of an agonist or antagonist 
compound (e.g. small molecule, peptide, peptidomimetic, protein or antibody) that can prevent 
normal binding of the wildtype AGT protein, can induce a therapeutic effect. 

In another aspect of the invention, assays, e.g., for screening test compounds to identify 
antagonists (e.g. inhibitors), or alternatively, agonists (e.g. potentiators), of an interaction 

25 between an AGT protein and, for example, a protein or nucleic acid that binds to the AGT 
protein or fragments of AGT are provided. An exemplary method includes the steps of (i) 
combining an AGT polypeptide or bioactive fragments thereof, an AGT target molecule (such as 
an AGT ligand or nucleic acid), and a test compound, e.g., under conditions wherein, but for the 
test compound, the AGT protein and AGT target molecule are able to interact; and (ii) detecting 

30 the formation of a complex which includes the AGT protein and the target molecule either by 
directly quantitating the complex or by measuring inductive effects of the AGT protein or 
fragments of AGT protein. A statistically significant change, such as a decrease, in the 
interaction of the AGT and AGT target molecule in the presence of a test compound (relative to 



6 

what is detected in the absence of the test compound) is indicative of a modulation (e.g., 
inhibition or potentiation of the interaction between the AGT protein or fragments of the AGT 
protein and the target molecule). 

In a further aspect of the present invention concerns methods for modulating the 
5 transcription of certain genes in a cell by modulating AGT bioactivity, (e.g., by potentiating or 
disrupting an AGT bioactivity). In general, whether carried out in vivo, in vitro, or in situ, the 
method comprises treating the cell with an effective amount of an AGT therapeutic (agonist or 
antagonist of an AGT bioactivity) so as to alter, relative to the cell in the absence of treatment, 
the level of transcription of certain genes. Accordingly, the method can be carried out with AGT 

10 therapeutics such as peptide and peptidomimetics or other molecules identified in the above- 
referenced drug screens which agonize or antagonize the effects of a AGT bioactivity (e.g. 
transcription) of a gene which is regulated by an AGT protein. Other AGT therapeutics include 
antisense constructs for inhibiting expression of AGT proteins, and dominant negative mutants 
of AGT proteins which competitively inhibit interactions between ligands (e.g. proteins) and 

15 nucleic acids upstream and downstream of the wild-type AGT protein. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1C show a physical map of the AGT region. Fig. 1A: Microsatellite markers 
used in genotyping. The STS WI 18298 used to screen the BAC library lies on BAC 1-30. Fig. 

20 IB: BACs containing the AGT gene. Fig. 1C: structure of AGT and alignment on BAC contig. 

Figures 2A-2E show haplotypes and segregation of AGT mutations in multiplex type 1 
diabetes families. Shared haplotypes are in shaded boxes. Fig. 2A: Cysl8Arg. Fig. 2B: C(96)T 
in alternative exon 1. Fig. 2C: T(286)C in alternative exon 1. Fig. 2D: Pro264Ser linked to a 
silent polymorphism is present on the haplotype with the asterisk in family 30. Fig. 2E: C(- 

25 81 1)T is on the same haplotype in family 30 as Pro264Ser. 

Figures 3A-3C show fare polymorphisms in AGT and expression of alternatively spliced 
transcripts. Fig. 1 A: Genomic organization of ^GT^and position of rare polymorphisms found in 
diabetes mellitus families. Approximate positions of promoters for the alternative and published 
transcripts are indicated by large arrows. The positions and types of rare polymorphisms 

30 identified in multiplex diabetes families are indicated above the exons. Within the coding exons 
the numbers refer to the base pair position in the cDNA. The polymorphism at -81 1 is relative 
to the published transcription start site in exon I (Gaillard et al., 1989). A schematic of the AGT 
polypeptide is shown below. The positions of the amino acid substitutions caused by the 



WO 00/71751 



PCT/US00/13327 



7 

polymorphisms are indicated relative to the start of the mature peptide. Coding region of AGT 
(black box); signal sequence(ss) (gray box); Ang I precursor peptide to the octapeptide Ang II 
(black box); mature peptide (black box and hatched box). Greater than lOOkb of genomic DNA 
(dashed line). All Cys positions of AGT indicated in a circle. Fig. 3B: Alignment of 
5 angiotensinogen from human, gorilla, chimpanzee, rat, sheep, and mouse is shown. Cys 18 in 
bold; Asnl37-Cysl38-Thrl39 underlined. Fig. 3C: Alignment of additional missense 
mutations: Alal22, Pro264, Asp319, and Ala401 shown in bold. 

DETAILED DESCRIPTION OF THE INVENTION 
10 The present invention is based in part on the discovery of polymorphisms in the AGT 

gene which are linked to insulin-dependent diabetes mellitus. Based on this finding the 
invention provides therapeutic methods, compositions and diagnostics for diabetes based on 
AGT. 

The cDNA for the AGT gene and the protein sequence are set forth in GenBank 

15 accession numbers K02215 and AH002594. The latter accession shows some genomic 
sequences (about 40-60 nucleotides) surrounding the 5 exons of the AGT gene. The AGT gene 
comprises 5 exons which are shown in SEQ ID NOS:l-5, along with genomic intron or 
untranslated sequences. SEQ ID NOs: 1-5 contain additional genomic sequences not previously 
disclosed. The AGT gene encodes a protein which comprises a 33 amino acid signal peptide 

20 and a 452 amino acid mature peptide. The coding sequence for AGT compiled from the exons 
in SEQ ID NOs: 1-5 is shown in SEQ ID NO:6. The corresponding amino acid sequence for 
AGT is set forth in SEQ ID NO:7. The sequence of alternate exon 1 along with surrounding 
genomic intron or untranslated sequences is shown in SEQ ID NO: 8. 

The present invention relates to AGT agonists and antagonists and their use in treating 

25 diabetes. For example, (i) nucleic acid molecules encoding functional AGT protein; (ii) nucleic 
acids that are effective antisense, ribozyme and triplex antagonists of nucleic acids encoding 
functional AGT protein; (iii) functional AGT proteins or peptides; (iv) anti-AGT antibodies; (v) 
drugs affecting wild-type or mutant AGT function or AGT interaction with an AGT receptor and 
preparations of such compositions are disclosed herein. In addition, the invention provides drug 

30 discovery assays for identifying additional agents that agonize or antagonize the biological 
function of AGT protein (e.g. by altering the interaction of AGT molecules with either 
downstream or upstream elements in the biochemical (e.g. signal transduction) pathway). 
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Moreover, the present invention provides assays for diagnosing whether a subject has or has a 
predisposition towards developing diabetes mellitus. 

Proof that any particular gene located within the genetically defined interval is a disease 
susceptability locus is obtained by finding sequences in DNA or RNA extracted from affected 
5 kindred members which create abnormal gene products or abnormal levels of gene product. 
Such disease susceptibility alleles will co-segregate with the disease in large kindreds. They 
will also be present at a much higher frequency in non-kindred individuals with the disease than 
in individuals in the general population. In identifying a disease susceptability locus, the key is 
to find polymorphisms or mutations which are serious enough to cause obvious disruption to the 

10 normal function of the gene product. These mutations can take a number of forms. The most 
severe forms would be frame shift mutations or large deletions which would cause the gene to 
code for an abnormal protein or one which would significantly alter protein expression. Less 
severe disruptive mutations would include small in-frame deletions and nonconservative base 
pair substitutions which would have a significant effect on the protein produced, such as changes 

15 to or from a cysteine residue, from a basic to an acidic amino acid or vice versa, from a 
hydrophobic to hydrophilic amino acid or vice versa, or other mutations which would affect 
secondary, tertiary or quaternary protein structure. Small deletions or base pair substitutions 
could also significantly alter protein expression by changing the level of transcription, splice 
pattern, mRNA stability, or translation efficiency of the gene transcript. Silent mutations or 

20 those resulting in conservative amino acid substitutions would not generally be expected to 
disrupt protein function. Causal mutations can also be found in the promoter of the gene. These 
mutations would interfere with the binding of regulatory factors and in this way alter 
transcription of the gene and therefore change the function of the gene. 

In one aspect, the invention features probes and primers for use in a prognostic or 

25 diagnostic assay. For instance, the present invention also provides a probe/primer comprising a 
substantially purified oligonucleotide, which oligonucleotide comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least approximately 12, preferably 25, 
more preferably 40, 50 or 75 consecutive nucleotides of sense or anti-sense sequence of AGT, 
including 5 ! and/or 3* untranslated regions. In preferred embodiments, the probe further 

30 comprises a label group attached thereto and able to be detected, e.g. the label group is selected 
from amongst radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

In a further aspect, the present invention features methods for determining whether a 
subject is at risk for developing diabetes mellitus. According to the diagnostic and prognostic 
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methods of the present invention, alteration of the wild-type AGT locus is detected. "Alteration 
of a wild-type gene" encompasses all forms of mutations including deletions, insertions and 
point mutations in the coding and noncoding regions. Deletions may be of the entire gene or of 
only a portion of the gene. Point mutations may result in stop codons, frameshift mutations or 
5 amino acid substitutions. Point mutations or deletions in the promoter can change transcription 
and thereby alter the gene function. Somatic mutations are those which occur only in certain 
tissues and are not inherited in the germline. Germline mutations can be found in any of a 
body's tissues and are inherited. The finding of AGT germline mutations thus provides 
diagnostic information. An AGT allele which is not deleted (e.g., found on the sister 
10 chromosome to a chromosome carrying an AGT deletion) can be screened for other mutations, 
such as insertions, small deletions, and point mutations. Point mutational events may occur in 
regulatory regions, such as in the promoter of the gene, or in intron regions or at intron/exon 
junctions. 

Useful diagnostic techniques include, but are not limited to fluorescent in situ 
15 hybridization (FISH), direct DNA sequencing, PFGE analysis, Southern blot analysis, single 
stranded conformation analysis (SSCA), RNase protection assay, allele-specific oligonucleotide 
(ASO), dot blot analysis and PCR-SSCP, as discussed in detail further below. Also useful is the 
recently developed technique of DNA microchip technology. In addition to the techniques 
described herein, similar and other useful techniques are also described in U.S. Patent Nos. 
20 5,837,492 and 5,800,998, each incorporated herein by reference. 

Predisposition to disease can be ascertained by testing any tissue of a human for 
mutations of the AGT gene. For example, a person who has inherited a germline AGT mutation 
would be prone to develop IDDM. This can be determined by testing DNA from any tissue of 
the person's body. Most simply, blood can be drawn and DNA extracted from the cells of the 
25 blood. In addition, prenatal diagnosis can be accomplished by testing fetal cells, placental cells 
or amniotic cells for mutations of the AGT gene. Alteration of a wild-type AGT allele, 
whether, for example, by point mutation or deletion, can be detected by any of the means 
discussed herein. 

There are several methods that can be used to detect DNA sequence variation. Direct 
30 DNA sequencing, either manual sequencing or automated fluorescent sequencing can detect 
sequence variation. For a gene as large as AGT, manual sequencing is very labor-intensive, but 
under optimal conditions, mutations in the coding sequence of a gene are rarely missed. 
Another approach is the single-stranded conformation polymorphism assay (SSCA) (Orita et ai, 
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1 989). This method does not detect all sequence changes, especially if the DNA fragment size 
is greater than 200 bp, but can be optimized to detect most DNA sequence variation. The 
reduced detection sensitivity is a disadvantage, but the increased throughput possible with SSCA 
makes it an attractive, viable alternative to direct sequencing for mutation detection on a 
research basis. The fragments which have shifted mobility on SSCA gels are then sequenced to 
determine the exact nature of the DNA sequence variation. Other approaches based on the 
detection of mismatches between the two complementary DNA strands include clamped 
denaturing gel electrophoresis (CDGE) (Sheffield et aL, 1991), heteroduplex analysis (HA) 
(White et aL 9 1992) and chemical mismatch cleavage (CMC) (Grompe et ai, 1989). None of the 
methods described above will detect large deletions, duplications or insertions, nor will they 
detect a regulatory mutation which affects transcription or translation of the protein. Other 
methods which might detect these classes of mutations such as a protein truncation assay or the 
asymmetric assay, detect only specific types of mutations and would not detect missense 
mutations. A review of currently available methods of detecting DNA sequence variation can be 
found in a recent review by Grompe (1993). Once a mutation is known, an allele specific 
detection approach such as allele specific oligonucleotide (ASO) hybridization can be utilized to 
rapidly screen large numbers of other samples for that same mutation. 

Detection of point mutations may be accomplished by molecular cloning of the AGT 
allele(s) and sequencing the allele(s) using techniques well known in the art. Alternatively, the 
gene sequences can be amplified directly from a genomic DNA preparation from the tissue, 
using known techniques. The DNA sequence of the amplified sequences can then be 
determined. 

There are six well known methods for a more complete, yet still indirect, test for 
confirming the presence of a susceptibility allele: 1) single-stranded conformation analysis 
(SSCA) (Orita et al. 9 1989); 2) denaturing gradient gel electrophoresis (DGGE) (Wartell et ai, 
1990; Sheffield et ai y 1989); 3) RNase protection assays (Finkelstein et al. 9 1990; Kinszler et 
al. 9 1991); 4) allele-specific oligonucleotides (ASOs) (Conner et al, 1983); 5) the use of 
proteins which recognize nucleotide mismatches, such as the E. coli mutS protein (Modrich, 
1991); and 6) allele-specific PCR (Rano and Kidd, 1989). For allele-specific PCR, primers are 
used which hybridize at their 3' ends to a particular AGT mutation. If the particular AGT 
mutation is not present, an amplification product is not observed. Amplification Refractory 
Mutation System (ARMS) can also be used, as disclosed in European Patent Application 
Publication No. 0332435 and in Newton et al. 9 1989. Insertions and deletions of genes can also 
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be detected by cloning, sequencing and amplification. In addition, restriction fragment length 
polymorphism (RFLP) probes for the gene or surrounding marker genes can be used to score 
alteration of an allele or an insertion in a polymorphic fragment. Such a method is particularly 
useful for screening relatives of an affected individual for the presence of the AGT mutation 
5 found in that individual. Other techniques for detecting insertions and deletions as known in the 
art can be used. 

In the first three methods (SSCA, DGGE and RNase protection assay), a new 
electrophoretic band appears. SSCA detects a band which migrates differentially because the 
sequence change causes a difference in single-strand, intramolecular base pairing. RNase 

10 protection involves cleavage of the mutant polynucleotide into two or more smaller fragments. 
DGGE detects differences in migration rates of mutant sequences compared to wild-type 
sequences, using a denaturing gradient gel. In an allele-specific oligonucleotide assay, an 
oligonucleotide is designed which detects a specific sequence, and the assay is performed by 
detecting the presence or absence of a hybridization signal. In the mutS assay, the protein binds 

15 only to sequences that contain a nucleotide mismatch in a heteroduplex between mutant and 
wild-type sequences. 

Mismatches, according to the present invention, are hybridized nucleic acid duplexes in 
which the two strands are not 100% complementary. Lack of total homology may be due to 
deletions, insertions, inversions or substitutions. Mismatch detection can be used to detect point 

20 mutations in the gene or in its mRNA product. While these techniques are less sensitive than 
sequencing, they are simpler to perform on a large number of tumor samples. An example of a 
mismatch cleavage technique is the RNase protection method. In the practice of the present 
invention, the method involves the use of a labeled riboprobe which is complementary to the 
human wild-type AGT gene coding sequence. The riboprobe and either mRNA or DNA isolated 

25 from the tumor tissue are annealed (hybridized) together and subsequently digested with the 
enzyme RNase A which is able to detect some mismatches in a duplex RNA structure. If a 
mismatch is detected by RNase A, it cleaves at the site of the mismatch. Thus, when the 
annealed RNA preparation is separated on an electrophoretic gel matrix, if a mismatch has been 
detected and cleaved by RNase A, an RNA product will be seen which is smaller than the full 

30 length duplex RNA for the riboprobe and the mRNA or DNA. The riboprobe need not be the 
full length of the AGT mRNA or gene but can be a segment of either. If the riboprobe comprises 
only a segment of the AGT mRNA or gene, it will be desirable to use a number of these probes 
to screen the whole mRNA sequence for mismatches. 
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In similar fashion, DNA probes can be used to detect mismatches, through enzymatic or 
chemical cleavage. See, e.g., Cotton et aL, 1988; Shenk et al, 1975; Novack et al, 1986. 
Alternatively, mismatches can be detected by shifts in the electrophoretic mobility of 
mismatched duplexes relative to matched duplexes. See, e.g., Cariello, 1988. With either 
5 riboprobes or DNA probes, the cellular mRNA or DNA which might contain a mutation can be 
amplified using PCR before hybridization. Changes in DNA of the AGT gene can also be 
detected using Southern hybridization, especially if the changes are gross rearrangements, such as 
deletions and insertions. 

DNA sequences of the AGT gene which have been amplified by use of PCR may also be 

10 screened using allele-specific probes. These probes are nucleic acid oligomers, each of which 
contains a region of the AGT gene sequence harboring a known mutation. For example, one 
oligomer may be about 30 nucleotides in length (although shorter and longer oligomers are also 
usable as well recognized by those of skill in the art), corresponding to a portion of the AGT 
gene sequence. By use of a battery of such allele-specific probes, PCR amplification products 

15 can be screened to identify the presence of a previously identified mutation in the AGT gene. 
Hybridization of allele-specific probes with amplified AGT sequences can be performed, for 
example, on a nylon filter. Hybridization to a particular probe under high stringency 
hybridization conditions indicates the presence of the same mutation in the tumor tissue as in the 
allele-specific probe. 

20 The newly developed technique of nucleic acid analysis via microchip technology is also 

applicable to the present invention. In this technique, literally thousands of distinct oligonucleotide 
probes are built up in an array on a silicon chip. Nucleic acid to be analyzed is fluorescently 
labeled and hybridized to the probes on the chip. It is also possible to study nucleic acid-protein 
interactions using these nucleic acid microchips. Using this technique one can determine the 

25 presence of mutations or even sequence the nucleic acid being analyzed or one can measure 
expression levels of a gene of interest. The method is one of parallel processing of many, even 
thousands, of probes at once and can tremendously increase the rate of analysis. Several papers 
have been published which use this technique. Some of these are Hacia et al., 1996; Shoemaker et 
al., 1996; Chee et al., 1996; Lockhart et al., 1996; DeRisi et al., 1996; Lipshutz et al., 1995. This 

30 method has already been used to screen people for mutations in the breast cancer gene BRCA1 
(Hacia et al., 1996). This new technology has been reviewed in a news article in Chemical and 
Engineering News (Borman, 1996) and been the subject of an editorial (Nature Genetics, 1996). 
Also see Fodor(1997). 
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The most definitive test for mutations in a candidate locus is to directly compare 
genomic AGT sequences from disease patients with those from a control population. 
Alternatively, one could sequence messenger RNA after amplification, e.g., by PCR, thereby 
eliminating the necessity of determining the exon structure of the candidate gene. 
5 Mutations from disease patients falling outside the coding region of AGT can be detected 

by examining the non-coding regions, such as introns and regulatory sequences near or within 
the AGT gene. An early indication that mutations in noncoding regions are important may come 
from Northern blot experiments that reveal messenger RNA molecules of abnormal size or 
abundance in disease patients as compared to control individuals. 

\ 0 Alteration of AGT mRNA expression can be detected by any techniques known in the 

art. These include Northern blot analysis, PCR amplification and RNase protection. Diminished 
or increased mRNA expression indicates an alteration of the wild-type AGT gene. Alteration of 
wild-type AGT genes can also be detected by screening for alteration of wild-type AGT protein. 
For example, monoclonal antibodies immunoreactive with AGT can be used to screen a tissue. 

1 5 Lack of cognate antigen would indicate an AGT mutation. Antibodies specific for products of 
mutant alleles could also be used to detect mutant AGT gene product. Such immunological 
assays can be done in any convenient formats known in the art. These include Western blots, 
immunohistochemical assays and ELISA assays. Any means for detecting an altered AGT 
protein can be used to detect alteration of wild-type AGT genes. Functional assays, such as 

20 protein binding determinations, can be used. In addition, assays can be used which detect AGT 
biochemical function. Finding a mutant AGT gene product indicates alteration of a wild-type 
AGT gene. 

The primer pairs of the present invention are useful for determination of the nucleotide 
sequence of a particular AGT allele using PCR. The pairs of single-stranded DNA primers can 

25 be annealed to sequences within or surrounding the AGT gene on chromosome 1 in order to 
prime amplifying DNA synthesis of the AGT gene itself. A complete set of these primers 
allows synthesis of all of the nucleotides of the AGT gene coding sequences, i.e., the exons. 
The set of primers preferably allows synthesis of both intron and exon sequences. Allele-specific 
primers can also be used. Such primers anneal only to particular AGT mutant alleles, and thus 

30 will only amplify a product in the presence of the mutant allele as a template. 

In order to facilitate subsequent cloning of amplified sequences, primers may have 
restriction enzyme site sequences appended to their 5* ends. Thus, all nucleotides of the primers 
are derived from AGT sequences or sequences adjacent to AGT, except for the few nucleotides 
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necessary to form a restriction enzyme site. Such enzymes and sites are well known in the art. 
The primers themselves can be synthesized using techniques which are well known in the art. 
Generally, the primers can be made using oligonucleotide synthesizing machines which are 
commercially available. Given the known sequences of the AGT exons and the 5' alternate 
5 exon, the design of particular primers is well within the skill of the art. Suitable primers for 
mutation screening are also described herein. 

The nucleic acid probes provided by the present invention are useful for a number of 
purposes. They can be used in Southern hybridization to genomic DNA and in the RNase 
protection method for detecting point mutations already discussed above. The probes can be 
10 used to detect PCR amplification products. They may also be used to detect mismatches with the 
AGT gene or mRNA using other techniques. 

It has been discovered that individuals with the wild-type AGT gene do not have IDDM 
which results from the AGT allele. However, mutations which interfere with the function of the 
AGT protein are involved in the susceptability to IDDM as shown herein. Thus, the presence of 
15 an altered (or a mutant) AGT gene which produces a protein having a loss of function, or altered 
function, directly correlates to an increased risk of disease. In order to detect an AGT gene 
mutation, a biological sample is prepared and analyzed for a difference between the sequence of 
the AGT allele being analyzed and the sequence of the wild-type AGT allele. Mutant AGT 
alleles can be initially identified by any of the techniques described above. The mutant alleles 
20 are then sequenced to identify the specific mutation of the particular mutant allele. 
Alternatively, mutant AGT alleles can be initially identified by identifying mutant (altered) AGT 
proteins, using conventional techniques. The mutant alleles are then sequenced to identify the 
specific mutation for each allele. The mutations, especially those which lead to an altered 
function of the AGT protein, are then used for the diagnostic methods of the present invention. 
25 The present invention employs definitions commonly used in the art with specific 

reference to the gene described in the present application. Such definitions can be found in U.S. 
Patent Nos. 5,837,492 and 5,800,998, each incorporated herein by reference. Such definitions 
are employed herein unless the context indicates otherwise. 
Methods of Use: Nucleic Acid Diagnosis and Diagnostic Kits 
30 In order to detect the presence of an AGT allele predisposing an individual to diabetes, a 

biological sample such as blood is prepared and analyzed for the presence or absence of 
predisposing alleles of AGT, Results of these tests and interpretive information are returned to 
the health care provider for communication to the tested individual. Such diagnoses may be 
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performed by diagnostic laboratories, or, alternatively, diagnostic kits are manufactured and sold 
to health care providers or to private individuals for self-diagnosis. Diagnositic or prognostic 
tests can be performed as described herein or using well known techniques, such as described in 
U.S. Patent No. 5,800,998, incorporated herein by reference. 
5 Initially, the screening method involves amplification of the relevant AGT sequences. In 

another preferred embodiment of the invention, the screening method involves a non-PCR based 
strategy. Such screening methods include two-step label amplification methodologies that are 
well known in the art. Both PCR and non-PCR based screening strategies can detect target 
sequences with a high level of sensitivity. 

1 0 The most popular method used today is target amplification. Here, the target nucleic acid 

sequence is amplified with polymerases. One particularly preferred method using polymerase- 
driven amplification is the polymerase chain reaction (PCR). The polymerase chain reaction and 
other polymerase-driven amplification assays can achieve over a million-fold increase in copy 
number through the use of polymerase-driven amplification cycles. Once amplified, the resulting 

1 5 nucleic acid can be sequenced or used as a substrate for DNA probes. 

When the probes are used to detect the presence of the target sequences (for example, in 
screening for diabetes susceptibility), the biological sample to be analyzed, such as blood or 
serum, may be treated, if desired, to extract the nucleic acids. The sample nucleic acid may be 
prepared in various ways to facilitate detection of the target sequence; e.g. denaturation, 

20 restriction digestion, electrophoresis or dot blotting. The targeted region of the analyte nucleic 
acid usually must be at least partially single-stranded to form hybrids with the targeting 
sequence of the probe. If the sequence is naturally single-stranded, denaturation will not be 
required. However, if the sequence is double-stranded, the sequence will probably need to be 
denatured. Denaturation can be carried out by various techniques known in the art. 

25 Analyte nucleic acid and probe are incubated under conditions which promote stable 

hybrid formation of the target sequence in the probe with the putative targeted sequence in the 
analyte. The region of the probes which is used to bind to the analyte can be made completely 
complementary to the targeted region of human chromosome 1. Therefore, high stringency 
conditions are desirable in order to prevent false positives. However, conditions of high 

30 stringency are used only if the probes are complementary to regions of the chromosome which 
are unique in the genome. The stringency of hybridization is determined by a number of factors 
during hybridization and during the washing procedure, including temperature, ionic strength, 
base composition, probe length, and concentration of formamide. These factors are outlined in, 
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for example, Maniatis <?/ ai, 1982 and Sambrook e/ a/., 1989. Under certain circumstances, the 
formation of higher order hybrids, such as triplexes, quadraplexes, etc., may be desired to 
provide the means of detecting target sequences. 

Detection, if any, of the resulting hybrid is usually accomplished by the use of labeled 
5 probes. Alternatively, the probe may be unlabeled, but may be detectable by specific binding 
with a ligand which is labeled, either directly or indirectly. Suitable labels, and methods for 
labeling probes and ligands are known in the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick translation, random priming or 
kinasing), biotin, fluorescent groups, chemiluminescent groups (e.g., dioxetanes, particularly 

10 triggered dioxetanes), enzymes, antibodies and the like. Variations of this basic scheme are 
known in the art, and include those variations that facilitate separation of the hybrids to be 
detected from extraneous materials and/or that amplify the signal from the labeled moiety. A 
number of these variations are reviewed in, e.g., Matthews and Kricka, 1988; Landegren et ai, 
1988; Mittlin, 1989; U.S. Patent 4,868,105, and in EPO Publication No. 225,807. 

15 As noted above, non-PCR based screening assays are also contemplated in this 

invention. This procedure hybridizes a nucleic acid probe (or an analog such as a methyl 
phosphonate backbone replacing the normal phosphodiester), to the low level DNA target. This 
probe may have an enzyme covalently linked to the probe, such that the covalent linkage does 
not interfere with the specificity of the hybridization. This enzyme-probe-conjugate-target 

20 nucleic acid complex can then be isolated away from the free probe enzyme conjugate and a 
substrate is added for enzyme detection. Enzymatic activity is observed as a change in color 
development or luminescent output resulting in a 10 3 -10 6 increase in sensitivity. For an example 
relating to the preparation of oligodeoxynucleotide-alkaline phosphatase conjugates and their 
use as hybridization probes see Jablonski et al. 9 1986. 

25 Two-step label amplification methodologies are known in the art. These assays work on 

the principle that a small ligand (such as digoxigenin, biotin, or the like) is attached to a nucleic 
acid probe capable of specifically binding AGT. Allele specific probes are also contemplated 
within the scope of this example and exemplary allele specific probes include probes 
encompassing the predisposing or potentially predisposing mutations summarized in herein. 

30 In one example, the small ligand attached to the nucleic acid probe is specifically 

recognized by an antibody-enzyme conjugate. In one embodiment of this example, digoxigenin 
is attached to the nucleic acid probe. Hybridization is detected by an antibody-alkaline 
phosphatase conjugate which turns over a chemiluminescent substrate. For methods for labeling 
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nucleic acid probes according to this embodiment see Martin et al, 1990. In a second example, 
the small ligand is recognized by a second ligand-enzyme conjugate that is capable of 
specifically complexing to the first ligand. A well known embodiment of this example is the 
biotin-avidin type of interactions. For methods for labeling nucleic acid probes and their use in 
5 biotin-avidin based assays see Rigby et al , 1 977 and Nguyen et al , 1 992. 

It is also contemplated within the scope of this invention that the nucleic acid probe 
assays of this invention will employ a cocktail of nucleic acid probes capable of detecting AGT. 
Thus, in one example to detect the presence of AGT in a cell sample, more than one probe 
complementary to AGT is employed and in particular the number of different probes is 

10 alternatively 2, 3, or 5 different nucleic acid probe sequences. In another example, to detect the 
presence of mutations in the AGT gene sequence in a patient, more than one probe 
complementary to AGT is employed where the cocktail includes probes capable of binding to the 
allele-specific mutations identified in populations of patients with alterations in AGT. In this 
embodiment, any number of probes can be used, and will preferably include probes 

15 corresponding to the major gene mutations identified as predisposing an individual to diabetes. 
Some candidate probes contemplated within the scope of the invention include probes that 
include the allele-specific mutations identified herein and those that have the AGT regions 
corresponding to SEQ ID NOs:l-5 and 8 both 5* and 3' to the mutation site. 
Methods of Use: Peptide Diagnosis and Diagnostic Kits 

20 ' Susceptibility to daibetes can also be detected on the basis of the alteration of wild-type 

AGT polypeptide. Peptide diagnositic or prognostic tests can be performed as described herein 
or using well known techniques, such as described in U.S. Patent No. 5,800,998, incorporated 
herein by reference. For example, such alterations can be determined -by sequence analysis in 
accordance with conventional techniques. More preferably, antibodies (polyclonal or 

25 monoclonal) are used to detect differences in, or the absence of, AGT peptides. The antibodies 
may be prepared in accordance with conventional techniques. Other techniques for raising and 
purifying antibodies are well known in the art and any such techniques may be chosen to 
achieve the preparations claimed in this invention. In a preferred embodiment of the invention, 
antibodies will immunoprecipitate AGT proteins or fragments of the AGT protein from solution 

30 as well as react with AGT peptides on Western or immunoblots of polyacrylamide gels. In 
another preferred embodiment, antibodies will detect AGT proteins and protein fragments in 
paraffin or frozen tissue sections, using immunocytochemical techniques. 
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Preferred embodiments relating to methods for detecting AGT or its mutations include 
enzyme linked immunosorbent assays (ELISA), radioimmunoassays (RIA), immunoradiometric 
assays (IRMA) and immunoenzymatic assays (IEMA), including sandwich assays using 
monoclonal and/or polyclonal antibodies. Exemplary sandwich assays are described by David et 
5 al in U.S. Patent Nos. 4,376,1 10 and 4,486,530, hereby incorporated by reference. 
Methods of Use: Drug Screening 

This invention is particularly useful for screening compounds by using a wild-type or 
mutant AGT polypeptide or binding fragment thereof in any of a variety of drug screening 
techniques. Drug screening can be performed as described herein or using well known 
10 techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated 
herein by reference. 

The AGT polypeptide or fragment employed in such a test may either be free in solution, 
affixed to a solid support, or borne on a cell surface. One method of drug screening utilizes 
eucaryotic or procaryotic host cells which are stably transformed with recombinant 

15 polynucleotides expressing the polypeptide or fragment, preferably in competitive binding 
assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One 
may measure, for example, for the formation of complexes between an AGT polypeptide or 
fragment and the agent being tested, or examine the degree to which the formation of a complex 
between an AGT polypeptide or fragment and a known ligand, e.g. AGT receptor (ATI), is 

20 interfered with by the agent being tested. 

Thus, the present invention provides methods of screening for drugs comprising 
contacting such an agent with an AGT polypeptide or fragment thereof and assaying (i) for the 
presence of a complex between the agent and the AGT polypeptide or fragment, or (ii) for the 
presence of a complex between the AGT polypeptide or fragment and a ligand, by methods well 

25 known in the art. In such competitive binding assays the AGT polypeptide or fragment is 
typically labeled. Free AGT polypeptide or fragment is separated from that present in a 
protein:protein complex, and the amount of free (i.e., uncomplexed) label is a measure of the 
binding of the agent being tested to AGT or its interference with AGT:ligand binding, 
respectively. 

30 Another technique for drug screening provides high throughput screening for compounds 

having suitable binding affinity to the AGT polypeptides and is described in detail in Geysen, 
PCT published application WO 84/03564, published on September 13, 1984. Briefly stated, 
large numbers of different small peptide test compounds are synthesized on a solid substrate, 
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such as plastic pins or some other surface. The peptide test compounds are reacted with AGT 
polypeptides and washed. Bound AGT polypeptides are then detected by methods well known 
in the art. 

Purified AGT can be coated directly onto plates for use in the aforementioned drug 
5 screening techniques. However, non-neutralizing antibodies to the polypeptide can be used to 
capture antibodies to immobilize the AGT polypeptide on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of specifically binding the AGT polypeptide compete with a test 
compound for binding to the AGT polypeptide or fragments thereof. In this manner, the 

10 antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
determinants of the AGT polypeptide. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 
cells (such as described above) which express a wild-type or mutant AGT gene and as a 
consequence of expression of wild type or mutant AGT demonstrate a specific phenotype.. The 

1 5 phenotype of the cells is examined to determine if the compound is capable of modulating the 
phenotype and thereby AGT function. 

Briefly, a method of screening for a substance which modulates activity of a polypeptide 
may include contacting one or more test substances with the polypeptide in a suitable reaction 
medium, testing the activity of the treated polypeptide and comparing that activity with the 

20 activity of the polypeptide in comparable reaction medium untreated with the test substance or 
substances. A difference in activity between the treated and untreated polypeptides is indicative 
of a modulating effect of the relevant test substance or substances. 

Prior to or as well as being screened for modulation of activity, test substances may be 
screened for ability to interact with the polypeptide, e.g., in a yeast two-hybrid system (e.g., 

25 Bartel et al., 1993; Fields and Song, 1989; Chevray and Nathans, 1992; Lee et al., 1995). This 
system may be used as a coarse screen prior to testing a substance for actual ability to modulate 
activity of the polypeptide. Alternatively, the screen could be used to screen test substances for 
binding to an AGT specific binding partner, or to find mimetics of an AGT polypeptide. 
Methods of Use: Rational Drug Design 

30 The goal of rational drug design is to produce structural analogs of biologically active 

polypeptides of interest or of small molecules with which they interact (e.g., agonists, 
antagonists, inhibitors) in order to fashion drugs which are, for example, more active or stable 
forms of the polypeptide, or which, e.g., enhance or interfere with the function of a polypeptide 
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in vivo. See, e.g., Hodgson, 1991. Rational drug design can be performed as described herein or 
using well known techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, 
each incorporated herein by reference. 

In one approach, one first determines the three-dimensional structure of a protein of 
interest (e.g., AGT polypeptide or fragments of the AGT polypeptide) or ? for example, of the 
AGT-receptor or ligand complex, by x-ray crystallography, by computer modeling or most 
typically, by a combination of approaches. Less often, useful information regarding the structure 
of a polypeptide may be gained by modeling based on the structure of homologous proteins. An 
example of rational drug design is the development of HIV protease inhibitors (Erickson et aL, 
1990). In addition, peptides (e.g., AGT polypeptide or fragments thereof) are analyzed by an 
alanine scan (Wells, 1991). In this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide is 
analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, and 
then to solve its crystal structure. In principle, this approach yields a pharmacore upon which 
subsequent drug design can be based. It is possible to bypass protein crystallography altogether 
by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active 
antibody. As a mirror image of a mirror image, the binding site of the anti-ids would be 
expected to be an analog of the original receptor. The anti-id could then be used to identify and 
isolate peptides from banks of chemically or biologically produced banks of peptides. Selected 
peptides would then act as the pharmacore. Thus, one may design drugs which have, e.g., 
improved AGT polypeptide activity or stability or which act as inhibitors, agonists, antagonists, 
etc. of AGT polypeptide activity. 

Following identification of a substance which modulates or affects polypeptide activity, 
the substance may be investigated further. Furthermore, it may be manufactured and/or used in 
preparation, i.e., manufacture or formulation, or a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered to individuals. 

Thus, the present invention extends in various aspects not only to a substance identified 
using a nucleic acid molecule as a modulator of polypeptide activity, in accordance with what is 
disclosed herein, but also a pharmaceutical composition, medicament, drug or other composition 
comprising such a substance, a method comprising administration of such a composition 
comprising such a substance, a method comprising administration of such a composition to a 
patient, e.g., for treatment or prophylaxis of diabetes, use of such a substance in the manufacture 
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of a composition for administration, e.g., for treatment or prophylaxis of diabetes, and a method 
of making a pharmaceutical composition comprising admixing such a substance with a 
pharmaceutical^ acceptable excipient, vehicle or carrier, and optionally other ingredients. 

A substance identified as a modulator of polypeptide function may be peptide or non- 
5 peptide in nature. Non-peptide "small molecules" are often preferred for many in vivo pharma- 
ceutical uses. Accordingly, a mimetic or mimic of the substance (particularly if a peptide) may 
be designed for pharmaceutical use. 

The designing of mimetics to a known pharmaceutically active compound is a known 
approach to the development of pharmaceuticals based on a "lead" compound. This might be 

10 desirable where the active compound is difficult or expensive to synthesize or where it is 
unsuitable for a particular method of administration, e.g., pure peptides are unsuitable active 
agents for oral compositions as they tend to be quickly degraded by proteases in the alimentary 
canal. Mimetic design, synthesis and testing is generally used to avoid randomly screening large 
numbers of molecules for a target property. 

15 There are several steps commonly taken in the design of a mimetic from a compound 

having a given target property. First, the particular parts of the compound that are critical and/or 
important in determining the target property are determined. In the case of a peptide, this can be 
done by systematically varying the amino acid residues in the peptide, e.g., by substituting each 
residue in turn. Alanine scans of peptide are commonly used to refine such peptide motifs. 

20 These parts or residues constituting the active region of the compound are known as its 
"pharmacophore". 

Once the pharmacophore has been found, its structure is modeled according to its 
physical properties, e.g., stereochemistry, bonding, size and/or charge, using data from a range 
of sources, e.g., spectroscopic techniques, x-ray diffraction data and NMR. Computational 
25 analysis, similarity mapping (which models the charge and/or volume of a pharmacophore, 
rather than the bonding between atoms) and other techniques can be used in this modeling 
process. 

In a variant of this approach, the three-dimensional structure of the ligand and its binding 
partner are modeled. This can be especially useful where the ligand and/or binding partner 
30 change conformation on binding, allowing the model to take account of this in the design of the 
mimetic. 

A template molecule is then selected onto which chemical groups which mimic the 
pharmacophore can be grafted. The template molecule and the chemical groups grafted onto it 
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can conveniently be selected so that the mimetic is easy to synthesize, is likely to be 
pharmacologically acceptable, and does not degrade in vivo, while retaining the biological 
activity of the lead compound. Alternatively, where the mimetic is peptide-based, further 
stability can be achieved by cyclizing the peptide, increasing its rigidity. The mimetic or 
5 mimetics found by this approach can then be screened to see whether they have the target 
property, or to what extent they exhibit it. Further optimization or modification can then be 
carried out to arrive at one or more final mimetics for in vivo or clinical testing. 
Methods of Use: Nucleic Acid Based Therapies 

According to the present invention, a method is also provided of supplying wild-type 

10 AGT function to a cell which carries mutant AGT alleles. The wild-type AGT gene or a part of 
the gene may be introduced into the cell in ,a vector such that the gene remains 
extrachromosomal. In such a situation, the gene will be expressed by the cell from the 
extrachromosomal location. If a gene fragment is introduced and expressed in a cell carrying a 
mutant AGT allele, the gene fragment should encode a part of the AGT protein which is 

15 required for normal physiological processes of the cell. More preferred is the situation where 
the wild-type AGT gene or a part thereof is introduced into the mutant cell in such a way that it 
recombines with the endogenous mutant AGT gene present in the cell. Such recombination 
requires a double recombination event which results in the correction of the AGT gene mutation. 
Vectors for introduction of genes both for recombination and for extrachromosomal 

20 maintenance are known in the art, and any suitable vector may be used. Methods for 
introducing DNA into cells such as electroporation, calcium phosphate coprecipitation and viral 
transduction are known in the art, and the choice of method is within the competence of the 
routineer. See also U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated by reference 
herein. 

25 Among the compounds which may exhibit anti-diabetes activity are antisense, ribozyme, 

and triple helix molecules. Such molecules may be designed to reduce or inhibit mutant AGT 
activity. Techniques for the production and use of such molecules are well known to those of 
skill in the art, such as described herein or in U.S. Patent No. 5,800,998, incorporated herein by 
reference. 

30 Antisense RNA and DNA molecules act to directly block the translation of mRNA by 

binding to targeted mRNA and preventing protein translation. With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the -10 and 
+10 regions of the AGT nucleotide sequence of interest, are preferred. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by a endonucleolytic cleavage. The 
composition of ribozyme molecules must include one or more sequences complementary to the 
5 target AGT mRNA, preferably the mutant AGT mRNA, and must include the well known 
catalytic sequence responsible for mRNA cleavage. For this sequence, see U.S. Pat. No. 
5,093,246, which is incorporated by reference herein in its entirety. As such, within the scope of 
the invention are engineered hammerhead motif ribozyme molecules that specifically and 
efficiently catalyze endonucleolytic cleavage of RNA sequences encoding AGT, preferably 

10 mutant AGT proteins. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified 
by scanning the target molecule for ribozyme cleavage sites which include the following 
sequence: GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may 

15 be evaluated for predicted structural features, such as secondary structure, that may render the 
oligonucleotide sequence unsuitable. The suitability of candidate targets may also be evaluated 
by testing their accessibility to hybridization with complementary oligonucleotides, using 
ribonuclease protection assays. 

Nucleic acid molecules to be used in triplex helix formation should be single stranded 

20 and composed of deoxynucleotides. The base composition of these oligonucleotides must be 
designed to promote triple helix formation via Hoogsteen base pairing rules, which generally 
require sizeable stretches of either purines or pyrimidines to be present on one strand of a 
duplex. Nucleotide sequences may be pyrimidine-based, which will result in TAT and 
CGC.sup.+ triplets across the three associated strands of the resulting triple helix. The 

25 pyrimidine-rich molecules provide base complementarity to a purine-rich region of a single 
strand of the duplex in a parallel orientation to that strand. In addition, nucleic acid molrcules 
may be chosen that are purine-rich, for example, contain a stretch of guanidine residues. These 
molecules will form a triple helix with a DNA duplex that is rich in GC pairs, in which the 
majority of the purine residues are located on a single strand of the targeted duplex, resulting in 

30 GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be targeted for triple helix formation may 
be increased by creating a so called "switchback" nucleic acid molecule. Switchback molecules 
are synthesized in an alternating 5'-3\ 3'-5 f manner, such that they base pair with one strand of a 
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duplex first and then the other, eliminating the necessity for a sizeable stretch of either purines 
or pyrimidines to be present on one strand of a duplex. 

It is possible that the antisense, ribozyme, and/or triple helix molecules described herein 
may reduce or inhibit the translation of mRNA produced by both normal and mutant AGT 
5 alleles. In order to ensure that substantial normal levels of AGT activity are maintained in the 
cell, nucleic acid molecules that encode and express AGT polypeptides exhibiting normal AGT 
activity may be introduced into cells which do not contain sequences susceptible to whatever 
antisense, ribozyme, or triple helix treatments. Such sequences may be introduced via gene 
therapy methods. Alternatively, it may be preferable to coadminister normal AGT protein into 

10 the cell or tissue in order to maintain the requisite level of cellular or tissue AGT activity. 

Antisense RNA and DNA molecules, ribozyme molecules and triple helix molecules of 
the invention may be prepared by any method known in the art for the synthesis of DNA and 
RNA molecules. These include techniques for chemically synthesizing 
oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as for example 

15 solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences encoding the antisense RNA 
molecule. Such DNA sequences may be incorporated into a wide variety of vectors which 
incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 

20 inducibly, depending on the promoter used, can be introduced stably into cell lines. 

Various well-known modifications to the DNA molecules may be introduced as a means 
of increasing intracellular stability and half-life. Possible modifications include, but are not 
limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' and/or 3' 
ends of the molecule or the use of phosphorothioate or 2* O-methyl rather than 

25 phosphodiesterase linkages within the oligodeoxyribonucleotide backbone. 

Gene therapy would be carried out according to generally accepted methods, for 
example, as described in further detail in U.S. Patent Nos. 5,837,492 and 5,800,998 and 
references cited therein, all incorporated by reference herein. Expression vectors in the context 
of gene therapy are meant to include those constructs containing sequences sufficient to express 

30 a polynucleotide that has been cloned therein. In viral expression vectors, the construct contains 
viral sequences sufficient to support packaging of the construct. If the polynucleotide encodes 
an antisense polynucleotide or a ribozyme, expression will produce the antisense polynucleotide 
or ribozyme. Thus in this context, expression does not require that a protein product be 
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synthesized. In addition to the polynucleotide cloned into the expression vector, the vector also 
contains a promoter functional in eukaryotic cells. The cloned polynucleotide sequence is under 
control of this promoter. Suitable eukaryotic promoters include those described above. The 
expression vector may also include sequences, such as selectable markers and other sequences 
conventionally used. 
Methods of Use: Peptide Therapy 

Peptides which have AGT activity can be supplied to cells which carry mutant or 
missing AGT alleles. Peptide therapy is performed as described herein or using well known 
techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated 
herein by reference. 

Protein can be produced by expression of the cDNA sequence in bacteria, for example, 
using known expression vectors. Alternatively, AGT polypeptide can be extracted from AGT- 
producing mammalian cells. In addition, the techniques of synthetic chemistry can be employed 
to synthesize AGT protein. Any of such techniques can provide the preparation of the present 
invention which comprises the AGT protein. Preparation is substantially free of other human 
proteins. This is most readily accomplished by synthesis in a microorganism or in vitro. 

Active AGT molecules can be introduced into cells by microinjection or by use of 
liposomes, for example. Alternatively, some active molecules may be taken up by cells, actively 
or by diffusion. Extracellular application of the AGT gene product may be sufficient to affect the 
development and or progression of diabetes.. Supply of molecules with AGT activity should 
lead to partial reversal of thediabetic phenotype. Other molecules with AGT activity (for 
example, peptides, drugs or organic compounds) may also be used to effect such a reversal. 
Modified polypeptides having substantially similar function are also used for peptide therapy. 

Alternatively, antibodies that are both specific for mutant AGT gene product and 
interfere with its activity may be used. Such antibodies may be generated using standard 
techniques described herein or using conventional techniques, such as described in U.S. Patent 
Nos. 5,837,492; 5,800,998 and 5,891,628, against the proteins themselves or against peptides 
corresponding to the binding domains of the proteins. Such antibodies include but are not 
limited to polyclonal, monoclonal, Fab fragments, F(ab').sub.2 fragments, single chain 
antibodies, chimeric antibodies, humanized antibodies etc. 
Methods of Use: Agonists or Antagonists for Diabetes Preven tion or Treatment 

Mutations in angiotensinogen predisposing to diabetes suggest that the protein or peptide 
fragments of the protein have roles in the development or continued functioning of the beta islet 
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cells of the pancreas. It is known from the literature that mutation of Cysl8 leads to a reduction 
of the cleavage of angiotensinogen by renin (Rosa et al., 1998). However, there are a series of 
experiments which would be necessary in order to address angiotensinogen' s role in the disease 
and to identify points for pharmacological intervention. 
5 The only known peptide derived by cleavage of angiotensinogen with renin is 

angiotensin 1. Angiotensin 1 (AI) is then subsequently cleaved by angiotensin converting 
enzyme (ACE) to angiotensin 2 (All) which is, itself, metabolized to angiotensin 3 (AIII) and 
smaller peptides. Two human receptors for angiotensin 2, ATI and AT2, are known in the 
literature, but it has been reported that other receptors for these active metabolites exist (for 

10 example, Kozlowski et al., 1993), and can be molecularly cloned. These receptors may have a 
biological distribution that includes the pancreas and/or be selective for one or more of the 
angiotensin 1 metabolites. Such receptors represent good targets for pharmacological 
intervention in diabetes. Equally, agonists, for example AI, All, AIII or synthetic derivatives 
such as those described by Samanen et al. (1991), or antagonists of the known angiotensin 

15 receptors (e.g., losartan) represent drugs for use in the treatment or prevention of diabetes. After 
the initial cleavage by renin a large polypetide remains. The breakdown of this polypeptide into 
other active fragments can be envisioned. Agonist and antagonists of the receptors for these 
fragments as well as the proteases responsible for there formation also represent drugs for the 
treatment or prevention of diabetes. 

20 Methods of Use: Identification of Functional Peptides Derived From AGT 

Nothing, in current literature, is known of the metabolic fate of the remainder of 
angiotensinogen once angiotensin 1 is cleaved off by renin. It is possible that this protein has a 
direct biological function or be cleaved to bio-active peptides. The existence of such peptides 
can be confirmed by using a panel of polyclonal antibodies which bind to angiotensingen and 

25 use these antibodies to look for immuno-reactive peptide fragments in human or other animal 
sera. Once such peptides are identified, their cognate receptors can be cloned using standard 
molecular biology approaches. Another approach would be to screen for receptor partners for 
random fragments of angiotensinogen using the yeast two hybrid system or other techniques to 
discover protein/protein interactions. The peptides, identified by any approach, may, 

30 themselves, have utility as drugs for the treatment of diabetes or their receptors may represent 
points for pharmacological intervention in diabetes, in addition the inhibitors or activators of 
the protease (s) responsible for the cleavage of the full length AGT protein represnet another 
opportunity to develop drugs to treat or prevent diabetes. 
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Methods of Use: Transformed Hosts: Transeenic/Knockout Animals and Models 

Similarly, cells and animals which carry a mutant AGT allele can be used as model 
systems to study and test for substances which have potential as therapeutic agents. These may 
be isolated from individuals with AGT mutations, either somatic or germline. Alternatively, the 
5 cell line can be engineered to carry the mutation in the AGT allele, as described above. After a 
test substance is applied to the cells, the phenotype of the cell is determined. Any trait of the 
transformed cells can be assessed using techniques well known in the art. Transformed hosts, 
transgenic/knockout animals and models are prepared and used as described herein or using well 
known techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each 

10 incorporated herein by reference. 

Animals for testing therapeutic agents can be selected after mutagenesis of whole 
animals or after treatment of germline cells or zygotes. Such treatments include insertion of 
mutant AGT alleles, usually from a second animal species, as well as insertion of disrupted 
homologous genes. Alternatively, the endogenous AGT gene(s) of the animals may be disrupted 

15 by insertion or deletion mutation or other genetic alterations using conventional techniques 
(Capecchi, 1989; Valancius and Smithies, 1991; Hasty et aL 9 1991; Shinkai et al. 9 1992; 
Mombaerts et aL, 1992; Philpott et ai 9 1992; Snouwaert et al y 1992; Donehower et al y 1992) to 
produce knockout or transplacement animals. A transplacement is similar to a knockout because 
the endogenous gene is replaced, but in the case of a transplacement the replacement is by 

20 another version of the same gene. After test substances have been administered to the animals, 
the the diabetic phenotypemust be assessed. If the test substance prevents or suppresses thethe 
diabetic phenotype, then the test substance is a candidate therapeutic agent for the treatment of 
diabetes.. These animal models provide an extremely important testing vehicle for potential 
therapeutic products. 

25 In one embodiment of the invention, transgenic animals are produced which contain a 

functional transgene encoding a functional AGT polypeptide or variants thereof. Transgenic 
animals expressing AGT transgenes, recombinant cell lines derived from such animals and 
transgenic embryos may be useful in methods for screening for and identifying agents that 
induce or repress function of AGT. Transgenic animals of the present invention also can be used 

30 as models for studying indications such as IDDM. 

In one embodiment of the invention, a AGT transgene is introduced into a non-human 
host to produce a transgenic animal expressing a human, murine or other species AGT gene. 
The transgenic animal is produced by the integration of the transgene into the genome in a 
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manner that permits the expression of the transgene. Methods for producing transgenic animals 
are generally described by Wagner and Hoppe (U.S. Patent No. 4,873,191; which is 
incorporated herein by reference), Brinster et al 1985; which is incorporated herein by reference 
in its entirety) and in "Manipulating the Mouse Embryo; A Laboratory Manual" 2nd edition 
5 (eds., Hogan, Beddington, Costantimi and Long, Cold Spring Harbor Laboratory Press, 1994; 
which is incorporated herein by reference in its entirety). 

It may be desirable to replace the endogenous AGT by homologous recombination 
between the transgene or a mutant gene and the endogenous gene; or the endogenous gene may 
be eliminated by deletion as in the preparation of "knock-out" animals. Typically, a AGT gene 

10 flanked by genomic sequences is transferred by microinjection into a fertilized egg. The 
microinjected eggs are implanted into a host female, and the progeny are screened for the 
expression of the transgene. Transgenic animals may be produced from the fertilized eggs from 
a number of animals including, but not limited to reptiles, amphibians, birds, mammals, and fish. 
Within a particularly preferred embodiment, transgenic mice are generated which overexpress 

15 AGT or express a mutant form of the polypeptide. Alternatively, the absence of a AGT in 
"knock-out" mice permits the study of the effects that loss of AGT protein has on a cell in vivo. 
Knock-out mice also provide a model for the development of AGT-related diabetes. 

Methods for producing knockout animals are generally described by Shastry (1995, 
1998) and Osterrieder and Wolf (1998). The production of conditional knockout animals, in 

20 which the gene is active until knocked out at the desired time is generally described by Feil et al. 
(1996), Gagneten et al. (1997) and Lobe and Nagy (1998). Each of these references is 
incorporated herein by reference. 

As noted above, transgenic animals and cell lines derived from such animals may find 
use in certain testing experiments. In this regard, transgenic animals and cell lines capable of 

25 expressing wild-type or mutant AGT may be exposed to test substances. These test substances 
can be screened for the ability to alter expression of wild-type AGT or alter the expression or 
function of mutant AGT. 

Pharmaceutical Compositions and Routes of Administration 

The AGT polypeptides, antibodies, peptides and nucleic acids of the present invention 
30 can be formulated in pharmaceutical compositions, which are prepared according to conventional 
pharmaceutical compounding techniques. See, for example, Remington's Pharmaceutical 
Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA). The composition may contain the 
active agent or pharmaceutical^ acceptable salts of the active agent. These compositions may 
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comprise, in addition to one of the active substances, a pharmaceutical^ acceptable excipient, 
carrier, buffer, stabilizer or other materials well known in the art. Such materials should be non- 
toxic and should not interfere with the efficacy of the active ingredient. The carrier may take a 
wide variety of forms depending on the form of preparation desired for administration, e.g., 
5 intravenous, oral, intrathecal, epineural or parenteral. 

For oral administration, the compounds can be formulated into solid or liquid 
preparations such as capsules, pills, tablets, lozenges, melts, powders, suspensions or emulsions. 
In preparing the compositions in oral dosage form, any of the usual pharmaceutical media may 
be employed, such as, for example, water, glycols, oils, alcohols, flavoring agents, preservatives, 
10 coloring agents, suspending agents, and the like in the case of oral liquid preparations (such as, 
for example, suspensions, elixirs and solutions); or carriers such as starches, sugars, diluents, 
granulating agents, lubricants, binders, disintegrating agents and the like in the case of oral solid 
preparations (such as, for example, powders, capsules and tablets). Because of their ease in 
administration, tablets and capsules represent the most advantageous oral dosage unit form, in 
15 which case solid pharmaceutical carriers are obviously employed. If desired, tablets may be 
sugar-coated or enteric-coated by standard techniques. The active agent can be encapsulated to 
make it stable to passage through the gastrointestinal tract while at the same time allowing for 
passage across the blood brain barrier. See for example, WO 96/1 1 698. 

For parenteral administration, the compound may be dissolved in a pharmaceutical 
20 carrier and administered as either a solution or a suspension. Illustrative of suitable carriers are 
water, saline, dextrose solutions, fructose solutions, ethanol, or oils of animal, vegetative or 
synthetic origin. The carrier may also contain other ingredients, for example, preservatives, 
suspending agents, solubilizing agents, buffers and the like. When the compounds are being 
administered intrathecally, they may also be dissolved in cerebrospinal fluid. 
25 The active agent is preferably administered in a therapeutically effective amount. The 

actual amount administered, and the rate and time-course of administration, will depend on the 
nature and severity of the condition being treated. Prescription of treatment, e.g. decisions on 
dosage, timing, etc., is within the responsibility of general practitioners or specialists, and 
typically takes account of the disorder to be treated, the condition of the individual patient, the 
30 site of delivery, the method of administration and other factors known to practitioners. 
Examples of techniques and protocols can be found in Remington 's Pharmaceutical Sciences. 

Alternatively, targeting therapies may be used to deliver the active agent more 
specifically to certain types of cell, by the use of targeting systems such as antibodies or cell 
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specific ligands. Targeting may be desirable for a variety of reasons, e.g. if the agent is 
unacceptably toxic, or if it would otherwise require too high a dosage, or if it would not 
otherwise be able to enter the target cells. 

Instead of administering these agents directly, they could be produced in the target cell, 
5 e.g. in a viral vector such as described above or in a cell based delivery system such as described 
in U.S. Patent No. 5,550,050 and published PCT application Nos. WO 92/19195, WO 94/25503, 
WO 95/01203, WO 95/05452, WO 96/02286, WO 96/02646, WO 96/40871, WO 96/40959 and 
WO 97/12635, designed for implantation in a patient. The vector could be targeted to the 
specific cells to be treated, or it could contain regulatory elements which are more tissue specific 

10 to the target cells. The cell based delivery system is designed to be implanted in a patient's body 
at the desired target site and contains a coding sequence for the active agent. Alternatively, the 
agent could be administered in a precursor form for conversion to the active form by an 
activating agent produced in, or targeted to, the cells to be treated. See for example, EP 
425,731 A and WO 90/07936. 

15 The practice of the present invention employs, unless otherwise indicated, conventional 

techniques of chemistry, molecular biology, microbiology, recombinant DNA, genetics, 
immunology, cell biology, cell culture and transgenic biology, which are within the skill of the 
art. See, e.g., Maniatis et aL, 1982; Sambrook et al, 1989; Ausubel et al, 1992; Glover, 1985; 
Anand, 1992; Guthrie and. Fink, 1991; Harlow and Lane, 1988; Jakoby and Pastan, 1979; 

20. Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 
Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, 
Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A 
Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic 
Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos 

25 eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et 
al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. 
Weir and C. C. Blackwell, eds., 1986); Hogan et al., Manipulating the Mouse Embryo, (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986).. A general discussion of 

30 techniques and materials for human gene mapping, including mapping of human chromosome 1, 
is provided, e.g., in White and Lalouel, 1988. 
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EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
Standard techniques well known in the art or the techniques specifically described below were 
5 utilized. 

EXAMPLE 1 
Methods 

Subjects, genotyping. and hanlotvpe analyses. The families used in this study are part of 

10 a collection of IDDM families at the Human Biological Data Interchange (HBDI), Philadelphia, 
PA. Genomic DNA samples were ordered from the HBDI Repository located at the Coriell 
Institute for Medical Research in Camden, NJ. In the initial part of this study samples were 
ordered for individuals from 38 nuclear families with 3 siblings affected with IDDM. After 
identification of sequence variants in this first set of families, an additional 308 unrelated 

15 individuals with IDDM were obtained from HBDI. The initial control set of 120 samples were 
the grandparents of the Utah families included in the Centre d'Etude du Polymorphisme Humain 
(CEPH) database. Unrelated spouses from other Utah families were used as additional controls. 
No information on the presence or absence of diabetes in these samples was available. 
Genotypes using fluorescently labelled primers were determined for 1 1 simple tandem repeat 

20 polymorphisms spanning the published IDDM linkage on chromosome lq (Concannon et al., 
1998). Marker positions and primer sequences were obtained from Genethon (D1S237, D1S469, 
D1S439, D1S479, D1S437, D1S225, D1S251, D1S1568, AFMa072xa5xl\ Marshfield 
(D1S320), and CHLC (D1S1621). Haplotype analysis was performed by using the program 
MCLINK (Thomas et al., 1999). Comparisons of observed frequencies of mutation in cases and 

25 controls were made with a chi-squared, log likelihood ratio, test. The null hypothesis is that the 
observations come from Binomial distributions with the same probability parameters. The p- 
value quoted for each comparison is that for rejection of the null hypothesis in favor of the 
alternative hypothesis that the Binomials have unequal probability parameters. 

Genomic sequencing and mutation detection. A human genomic BAC library (Research 

30 Genetics) was screened by PCR for clones containing the marker WI 18298. The end sequences 
of BAC 1-30 were used as probes to assemble a BAC contig. 6x genomic sequencing coverage 
of BACs 7-30, 1-104 and 1-112 was obtained by sequencing of plasmid libraries constructed 
with randomly sheared and Saw3A-digested BAC DNA. For mutation detection exons were 
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amplified from genomic DNA with primers placed in intronic sequences flanking each exon. 
Nested amplifications were performed with primers containing Ml 3 forward (5'- 
GTTTTCCC AGTC ACG ACG-3 ' ; SEQ ID NO:9) and reverse (5 ' - AGG AAAC AGCTATGAC 
CAT-3 5 ; SEQ ID NO: 10) universal primer sequences at the 5' termini. Nested PGR products 
5 were cycle sequenced directly and run on ABI 377 automated sequencers (Perkin-Elmer). 
Sequence data were analyzed for polymorphisms using software developed at Myriad Genetics, 
Inc. 

GenBank accession numbers. Human AGT retina EST, AA059356; human AGT 
M24689; rat AGT L00094; sheep AGT D17520; mouse AGT AF045887; gorilla AGT 188488; 
10 chimpanzee AGT 1 88487. The sequences of AGT for gorilla and chimpanzee were generated in 
this study. 

EXAMPLE 2 
Association of AGT Gene and IDDM 

15 To investigate whether mutations in AGT are responsible for the reported linkage of type 

1 diabetes to the chromosomal region lq42-43, we carried out a three stage analysis. Initially 
we obtained a set of 38 nuclear families, each with three children affected with type 1 diabetes 
from the Human Biological Data Interchange. These were screened to identify potentially 
deleterious mutations. We then screened independent sets of 308 unrelated cases and 1003 

20 controls to determine whether the identified mutations were more common in cases than 
controls. Finally we screened all 308 cases and 631 of the 1003 controls to determine whether 
rare polymorphisms, potential causal mutations, were more common in cases than controls. In 
order to select from the 38 initial families those that could potentially segregate a mutation 
among all affected members, each family was genotyped using 11 microsatellite markers 

25 spanning a 32 cM region surrounding the AGT locus on chromosome 1 (Fig. 1). This revealed 
that in 20 of the 38 families the affected individuals shared a haplotype in part of the 32 cM 
region. 

The entire coding sequence, intron-exon boundaries, and 5' and 3' untranslated regions 
of A GT were screened for sequence variants in the genomic DNA of one affected member from 
30 each of the 20 nuclear families. Genomic DNA was amplified using primers positioned in 
introns flanking each exon of AGT (Table 1). Three families each contained a unique 
polymorphism in AGT, which we later determined segregated into all affected individuals in the 
family. 
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In family 24, a T to C transition was found in exon 2 at nucleotide 190 in the cDNA, 
which resulted in a substitution of arginine for cysteine at position 18 in the angiotensinogen 
mature peptide (Fig. 2a; Fig. 3a; 3b). Cysl8 is one of two conserved cysteine residues in 
mammalian angiotensinogen(s) (Fig. 3) that has been shown in vitro to form a disulfide bond 
5 with Cysl38 (Gimenez-Roqueplo et al., 1998a; Streatfeild- James et al., 1998). Disruption of 
disulfide bond formation might affect an unknown role for AGT in diabetes. Although it seems 
likely that disulfide bond formation contributes to a structurally constrained N-terminus 
(Streatfeild-James et al., 1998), disruption of the disulfide bond by in vitro mutagenesis of 
Cysl8 did not affect the rate of angiotensinogen cleavage by renin (Gimenez-Roqueplo et al., 

10 1998a). However, under physiological conditions, alteration of the protein's structure might 
disrupt association with other binding partners that would then indirectly affect cleavage by 
renin (Inoue et al., 1997). It is also possible that mutation of Cysl8 indirectly affects 
glycosylation of angiotensinogen. The partner with which Cysl8 forms a disulfide bond, 
Cysl38, lies in the middle of the N-glycosylation site Asnl37-Cysl38-Thrl39 (Fig. 3b). Of the 

15 four N-glycosylation sites in angiotensinogen, Asnl37 is the least glycosylated (Gimenez- 
Roqueplo et al., 1998b). In the absence of disulfide bond formation at Cysl38, Asnl37 might 
be glycosylated more frequently. Mutations in AGT that lead to increased glycosylation affect 
the amount of angiotensinogen secreted, and therefore indirectly the amount of Ang I (the 
peptide cleaved from AGT by renin and further converted to Ang II) produced (Gimenez- 

20 Roqueplo et al., 1996). Consequently, Cysl8Arg would cause a decrease in Ang I formation. 

Mutations detected in family 19 and family 30 were both in a 5' untranslated region of 
AGT. An alternative 5' exon of ^4GThas been reported in the EST database. This alternate 
form was verified by PCR and sequencing in this study. We determined from our BAG contig 
that the alternative exon lies at least lOOkb upstream of the published exon I (Fig. 1, Fig. 3), and 

25 it was only detected by PCR in retina cDNA. The alternative exon that splices into exon II does 
not contain an open reading frame that extends out of the exon. Translation of the alternative 
transcript is predicted to begin at the first methionine in exon II (Fig. 3 a) as previously reported 
(Gaillard et al., 1989). Although the alternative transcript produces the same angiotensinogen 
polypeptide as the published AGT cDNA, its expression could be regulated in a tissue-specific 

30 manner. It is not known what effect variants in the 5' untranslated region would have on the 
expression or stability of the transcript. A C to T transition at nucleotide 96 in the exon was 
seen in all affected individuals in family 19 (Fig. 2b; Fig. 3a). In family 30, a T to C transition 
at nucleotide 286 in this exon segregated with the disease (Fig. 2c; Fig. 3a). 
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Family 30 also contained a C to T transition at nucleotide 928 in exon 3, resulting in a 
non-conservative substitution of serine for proline at position 264 (Fig. 3a, 3c). This position is 
a proline in hominoids and a hydrophobic amino acid in other non-primate species (Fig. 3c); 
therefore the substitution of a hydrophilic amino acid (serine) is expected to have profound 
5 structural consequences. The Pro264Ser missense mutation is linked to a silent G to A transition 
at the preceding nucleotide, as well as to a C to T transition at -81 1 in the upstream genomic 
sequence (shown in SEQ ID NO:l). This haplotype of three linked polymorphisms (C-811T, 
G927A and Pro264Ser) did not segregate into all the affected individuals in family 30 (Fig. 2d, 
2c). However, this is not unexpected given that type 1 diabetes is a complex trait, and that 
10 predisposition to the disease probably involves a combination of mutations in multiple genes. If 
carrying a subset of these mutated genes results in the diabetic phenotype, members of the same 
family that do not share at the AGT locus could have an overlapping set of mutations in other 
gene(s). 

15 EXAMPLE 3 

Identification of AdditionalPolvmorphisms 
If the AGT polymorphisms identified in Example B are truly predisposing mutations, 
they should be detected in other type 1 diabetic individuals and should be rare or absent in 
controls. To identify additional mutations, a set of DNA samples from 308 unrelated type 1 

20 diabetic individuals was screened for sequence variants in AGT. These cases were selected 
from 308 additional families in the Human Biological Data Interchange. A set of control 
samples was screened in the same way for statistical comparison (see Example 1 and Table 2). 
The Cysl8Arg mutation was observed three times in the new case set. It segregated with the 
disease in family 164, but not in families 214 and 344 (Table 2). The Cysl8Arg mutation was 

25 observed in only 1 of 1003 control samples, indicating that it is enriched in the disease set (3 of 
308 vs. 1 of 1003,/? =0.03). 
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In the new case set the Pro264Ser mutation was seen once, with its accompanying 
polymorphisms, but was not seen in the controls (1/308 vs. 0/1003, p =0.09). The C-811T 
polymorphism was observed once in the controls but in isolation, not as part of the C-811T, 
G927A and Pro264Ser haplotype. The G297A change was not seen in the controls. 
5 A number of novel polymorphisms in AGT were detected in the second set of unrelated 

diabetic individuals. Segregation analyses showed that about 50% of the polymorphisms 
segregated into all affected individuals within a family (Table 2). An individual in family 54 
contained a G to T transversion at nucleotide 26 in alternative exon I. Within the genomic 
sequence upstream of exon I, the sequence variants A(-715)G, T(-363)A, G(-259)A, T(-253)C 

10 and G(-183)A were detected in individuals from families 266, 162, 339, 236 and 150, 
respectively. Three other polymorphisms in the upstream genomic sequence, T(-741)C, AG(- 
603) and C(-281)T, were linked in family 1 14. Although each of these changes lies upstream of 
the characterized proximal promoter in AGT (Brasier et al., 1999), they could potentially alter 
AGT expression by affecting promoter elements not yet identified. Recently, Yanai et al. (1999) 

1 5 have reported a role for hepatocyte nuclear factor 4 (HNF4) and chicken ovalabumin upstream 
pormoter transcription factor (COUP-TFII) in regulating transcription of the angiotensinogen 
gene. Truncating mutations in the HNF4 gene were found in patients with maturity onset 
diabetes of the youn, and the HNF4 gene is therefor also called MODY1 (Yamagata et al., 1996; 
Lidner et aL, 1997). One of the DNA fragments identified as an HNF4 activating element by 

20 deletion analysis is the fragment from -242 to -295. Three of the variants in the sequence 
upstream of exon 1 are within this fragment (T(-253)C in family 236, G(-259)A in family 339 
and C(-281)T in family 114). None of these variants was detected in 63 1 controls. 

Three previously unreported missense mutations were detected in the coding sequence of 
AGT. An Alal22Val missense mutation was detected in family 305, an Asp3 19Asn was present 

25 in family 318, and an Ala401Val was found in family 272. Of these, only Alal22Val segregated 
with the disease in the family. Considering the rare missense mutations which were detected in 
AGT, five are at positions highly conserved in primates (Cysl8Arg, Alal22Val, Pro264Ser, 
Asp319Asn, and Ala401Val, see Fig. 3b, 3c). In the 3' UTR a G to T transversion 35 bp 
downstream of the translation stop (G1530T) was observed in an individual from family 229. 

30 The discovery of enhancer elements in exon 5 and in the genomic sequence downstream of AGT 
(Nibu et al., 1994a; Nibu et al., 1994b) suggests this change might affect AGT expression. This 
change was detected once in 636 controls. Finally, two missense mutations that had been 
identified previously in hypertensive individuals were seen once in the set of type 1 diabetics. A 
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Tyr248Cys mutation was detected in an individual from family 348. This change was 
previously reported to affect the rate of angiotensinogen production in hypertensive individuals 
(Gimenez-Roqueplo et al., 1996). An individual in family 119 contained a Leu359Met 
substitution that had been shown previously in a hypertensive patient (Jeunemaitre et al., 1992). 
5 Both changes were also found in our control samples (Table 2). Neither of these missense 
changes segregated with diabetes in these families. 

All of the coding and nod-coding regions of AGT which were screened for 
polymorphisms in type 1 diabetics were also sequenced in 631 control individuals. Several 
polymorphisms were seen in the controls that were not seen in the cases (see Table 3). The 

10 frequency of rare changes, defined as present at less than 5 times in all samples, was compared 
in cases and controls. Fewer control samples carried rare changes when compared to type I 
diabetic cases (20 of 308 vs. 17 of 631, /?=0.007). It is interesting to note that there were no 
polymorphisms detected in the sequence upstream of exon 1 between -64 and -413 in the 631 
controls. There were five polymorphisms detected in this region in the type 1 diabetic cases 

15 including the three changes found in the fragment involved in activation of transcription of AGT 
by HNF4. 
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To conclude, the nature of the polymorphisms found in an initial set of diabetic families, 
their combined frequency in an independent set of cases compared with that in a set of controls 
(4/308 vs. 1/1003, z^O.008), and the abundance of rare mutations in type I diabetic cases all 
indicate that mutations in AGT increases susceptibility to the disease. The identification of 

5 missense mutations confirms the importance of the mature part of the protein in AGT structure 
and function. Novel polymorphisms in the alternative 5' exon could provide clues to tissue- 
specific functions of the renin-angiotensin system. The role of AGT in the microvascular 
complications of diabetes (Krolewski et al., 1998; Rogus et al., 1998) can be further considered. 
Perhaps the most interesting prospect is that our results in conjunction with the recent 

10 demonstration of angiotensin II receptors on cells of the islets of Langerhans that also stain for 
insulin, implies a direct role for AGT in pancreatic p-cell function (Tahmasebi et al., 1999). 
Understanding the contribution of AGT to type 1 diabetes could lead to new treatments and 
preventative medicines for the disease. 

15 While the invention has been disclosed in this patent application by reference to the 

details of preferred embodiments of the invention, it is to be understood that the disclosure is 
intended in an illustrative rather than in a limiting sense, as it is contemplated that modifications 
will readily occur to those skilled in the art, within the spirit of the invention and the scope of 
the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid selected from the group consisting of: 

(a) a DNA molecule. comprising the nucleotides set forth in SEQ ID NO:l, or 
complement thereof; 

(b) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:2, or 
complement thereof; 

(c) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:3, or 
complement thereof; 

(d) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:4, or 
complement thereof; 

(e) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:5, or 
complement thereof; and 

(f) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, or 
complement thereof. 

2. A nucleic acid selected from the group consisting of: 

(a) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein G at nucleotide position 26 of the exon is substituted with T, or a complement 
thereof; 

(b) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein C at nucleotide position 96 of the exon is substituted with T, or a complement 
thereof; 

(c) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein T at nucleotide position 286 of the exon is substituted with C, or a complement 
thereof; 

(d) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein C at nucleotide position 2201 is substituted with T, or a complement thereof 

(e) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2271 is substituted with C, or a complement thereof 

(f) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein G at nucleotide position 603 is deleted, or a complement thereof 

(g) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein C at nucleotide position 273 1 is substituted with T, or a complement thereof 
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(h) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein A at nucleotide position 2297 is substituted with G, or a complement thereof 

(i) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2649 is substituted with A, or a complement thereof 

(j) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein G at nucleotide position 2753 is substituted with A, or a complement thereof 

(k) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2759 is substituted with C, or a complement thereof 

(1) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein G at nucleotide position 2829 is substituted with A, or a complement thereof 

(m) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein T at nucleotide position 190 is substituted with C, or a complement thereof; 

(n) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 503 is substituted with T, or a complement thereof; 

(o) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein G at nucleotide position 927 is substituted with A, or a complement thereof; 

(p) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 928 is substituted with T, or a complement thereof; 

(q) a DNA molecule comprising the nucleotides set forth in SEQ ID NO: 7, 
wherein G at nucleotide position 1093 is substituted with A, or a complement thereof; 

(r) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 1340 is substituted with T, or a complement thereof; 
and 

(s) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein G at nucleotide position 1529 is substituted with T, or a complement thereof; 

A polypeptide selected from the group consisting of: 

(a) a protein molecule , comprising the amino acids set forth in SEQ ID NO:7, 
wherein Cys at position 5 1 is substituted with Arg; 

(b) a protein molecule comprising the amino acids set forth in SEQ ID NO: 7, 
wherein Ala at position 155 is substituted with Val; 

(c) a protein molecule comprising the amino acids set forth in SEQ ID NO: 7, 
wherein Pro at position 297 is substituted with Ser; 
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(d) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Asp at position 352 is substituted with Asn; and 

(e) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Ala at position 434 is substituted with Val. 

4. A method for detecting an alteration in AGT wherein said alteration is associated with 
IDDM in a human, wherein said method comprises analyzing an AGT gene or an AGT 
gene expression product from a tissue of said human. 

5. A method as claimed in claim 4 wherein the sequence of the AGT gene in said sample is 
compared with the sequence of one or more wild-type ^G7*gene sequences. 

6. A method for determining whether a human subject has or is at risk for developing 
diabetes mellitus comprising the step of: 

a) obtaining a sample from a subject, said sample comprising nucleic acid 
molecules containing AGT gene; and 

b) detecting the presence or absence of a genetic polymorphism in the gene of 
said subject, wherein the presence of said genetic polymorphism identifies a subject that 
has or is at risk for developing diabetes. 

7. A method for determining whether a human subject has or is at risk for developing 
diabetes mellitus comprising the step of: 

a) obtaining a sample from a subject, said sample comprising nucleic acid 
molecules containing AGT gene; and . 

b) detecting the presence or absence of a genetic polymorphism in the gene of 
said subject, wherein the presence of said genetic polymorphism identifies a subject that 
has or is at risk for developing diabetes, 

wherein said polymorphism is one of those specified in claim 2. 

8. A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes wild-type AGT, or agonizes or antagonizes mutant AGT, or agonizes or 
antagonizes an AGT receptor. 
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The method of claim 8, wherein. said compound is selected from the group consisting of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild-type AGT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes wild-type AGT fragment(s), or agonizes orantagonizes mutant AGT, or 
agonizes or antagonizes a receptor for a fragment or fragments of AGT. 

The method of claim 10, wherein said compound is selected from the group consisting 
of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild -type A GT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes a protease that cleaves AGT or any of its fragments. 
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The method of claim 12, wherein said compound is selected from the group consisting 
of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild-type A GT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

A non-human animal which carries an altered AGT allele in its genome, wherein said 
allele comprises one of the polymorphisms set forth in claim 2. 

A cell line isolated from the non-human animal of claim 14. 

A method of screening for drug candidates useful in treating diabetes resulting from a 
mutation in AGT, wherein said method involves mixing a mutant AGT in both the 
presence of a drug and the absence of said drug and measuring the level of the biological 
activity of the mutant AGT, wherein if the level of the biological activity is less in the 
presence of said drug than in the absence of said drug then said drug is a drug candidate 
for treating diabetes. 

A method of screening for drug candidates useful in treating diabetes resulting from a 
mutation in AGT, wherein said method comprises treating an animal which is 
heterozygous or homozygous for AGT containing said mutation with a drug wherein if 
said animal does not develop diabetes then said drug is a drug candidate for treating 
diabetes. 
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A method for screening potential diabetes therapeutics which comprises: combining an 
AGT binding partner and a compound suspected of being an diabetes therapeutic and 
measuring the biological activity of the binding partner. 
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<120> DIABETES GENE 

<130> Diaberes Gene III 

<140> 
<141> 

<150> US 60/174,700 
<151> 2000-01-06 

<150> US 60/135, 423 
<151> 1999-05-21 

• <160> 68 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 5308 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (3012) . . (3047) 
<223> exon 1 of AGT gene 

<400> 1 



gccccagcca 


cttgacccaa 


cctggagctc 


acaaaacccc 


gtgttgtata 


aaagtaaggt 


60 


ttaagggtct 


agggctgtgt 


caaagtggct 


ggggcaaagc 


tacaaattaa 


caacatctca 


120 


gcaaagcaat 


tgtttaaagt 


acagctcttt 


ttcaaaatgg 


agtctcttat 


gtcttccctt 


180 


tctatataga 


cacagtaaca 


gtctgatctc 


tctttttttt 


ccctacatat 


cccaatagca 


240 


cctgcctgac 


taatacatca 


tgctctgctg 


actccatatg 


tggctggttt 


cctggatccg 


300 


attctgactg 


atggtatctg 


ttctcacgct 


gctaataaag 


acataccaga 


gactgggtaa 


360 


tttataaaga 


aaaagaggtt 


taatggactt 


atgtgttcca 


cgtggctggg 


gaggcctcac 


420 


aatcatggtg 


gaaggtgaag 


gaggagcaaa 


agcacatctt 


acatggtggc 


tggcaaaaga 


480 


gagaaagcat 


gttcagggga 


actccccttt 


ataaaaccat 


cagatctcat 


gagacttatg 


540 


cactatcaca 


agaacagcac 


cagaaagatc 


caccctcatg 


attcaattac 


ctcccactgg 


600 


gtccctcccg 


tgacacatgg 


gaattatggg 


agctacaatt 


ggagatttgg 


gtggggacac 


660 


agccaaacca 


tatcagatgg 


cttatttggt 


ttctatgtag 


aacctctgct 


tttcattcaa 


720 


cagtcttcat 


ttagccacag 


ataagctctg 


tccctaactt 


ccgctgatgg 


aatgtacaca 


780 
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taagaaactt 


ccactgatgg 


aatgaacaca 


gaaggtgcct 


actgggaaga 


aaactggcct 


840 


gaatctgagc 


tgggtcaaat 


gtctgcagtc 


agtttgaatg 


gctgctcctt 


atgggaataa 


900 


tttacattct 


caataaaatt 


ctctagcaat 


tttctgattg 


attttaatga 


gctttaaagc 


960 


cttacgtaga 


agatccccca 


gctgatagtc 


agccttgggc 


atggattaag 


ggcttttaac 


1020 


caatcttgca 


acaagtttaa 


gcagatattc 


tttattgggt 


ccaatctaac 


caaaattatt 


1080 


ttcttatgtt 


ctccccagta 


acgtgtcatt 


attaagagaa 


gtttggcttg 


cttagaggcc 


1140 


aaatttagag 


ggtcctgaaa 


ttttattttc 


ttttacacca 


ctttccagca 


tgttacctga 


1200 


tcagttgttt 


attatctttg 


ctgttgaatg 


gagtgatcat 


tccaagggcc 


cgaggcagga 


1260 


ggcccaggca 


cagtggaaac 


tctcccaaag 


accaggatct 


ttgttttgtt 


ccctgacata 


1320 


tgctgagcac 


caggaatagt 


gaatgaatga 


aacaaattgt 


gaggctttaa 


agagccgaaa 


1380 


tatttaaaca 


ctgggcacaa 


ggttgttgct 


taatcagtgc 


tagatcctta 


cctccccctt 


1440 


gtgtccaggt 


cgacttgtta 


ctgcagttaa 


accacttgct 


gatcctcaaa 


caactagtta 


1500 


gtggcacagc 


caggcctagg 


accccagtct 


ctactgttcc 


aactaaccca 


ttcgcaggca 


1560 


ggagcacttt 


gaatggtctc 


ttattttaaa 


aaaattaaat 


taaaattgtc 


tatttattta 


1620 


gagacagagt 


cttactctgt 


agcccaggct 


cgagtgcagt 


ggtgcaatca 


tagctcactg 


1680 


taacctccat 


ctcctggcct 


caaaaagtgt 


ttgaattaca 


gatgcgaggc 


actgtacctg 


1740 


gcccgaatgt 


tctgttcaga 


caaagccacc 


tctaagtcgc 


tgtggggccc 


cagacaagtg 


1800 


atttttgagg 


agtccctatc 


tataggaaca 


aagtaattaa 


aaaaatgtat 


ttcagaattt 


1860 


acaggcccat 


gtgagatatg 


atttttttaa 


atgaagattt 


agagtaatgg 


gtaaaaaaga 


1920 


ggtatttgtg 


tgtttgttga 


ttgttcagtc 


agtgaatgta 


cagcttctgc 


ctcatatcca 


1980 


ggcaccatct 


cttcctgctc 


tttgttgtta 


aatgttccat 


tcctgggtaa 


tttcatgtct 


2040 


gccatcgtgg 


atatgccgtg 


gctccttgaa 


cctgcttgtg 


ttgaagcagg 


atcttccttc 


2100 


ctgtcccttc 


agtgccctaa 


taccatgtat 


ttaaggctgg 


acacatcacc 


actcccaacc 


2160 


tgcctcaccc 


actgcgtcac 


ttgtgatcac 


tggcttctgg 


cgactctcac 


caaggtctct 


2220 


gtcatgccct 


gttataacga 


ctacaaaagc 


aagtcttacc 


tataggaaaa 


taagaattat 


2280 


aaccctttta 


ctggtcatgt 


gaaacttacc 


atttgcaatt 


tgtacagcat 


aaacacagaa 


2340 


cagcacatct 


ttcaatgcct 


gcatcctgaa 


ggcattttgt 


ttgtgtcttt 


caatctggct 


2400 


gtgctattgt 


tggtgtttaa 


cagtctcccc 


agctacactg 


gaaacttcca 


gaaggcactt 


2460 


ttcacttgct 


tgtgtgtttt 


ccccagtgtc 


tattagaggc 


ctttgcacag 


ggtaggctct 


2520 


ttggagcagc 


tgaaggtcac 


acatcccatg 


agcgggcagc 


agggtcagaa 


gtggcccccg 


2580 
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tgttgcctaa gcaagactct cccctgccct 
ctgtggcctc trgggggtac atctcccggg 
caggctgtca cacacctagg gagatgctcc 
caaacttcgg taaatgtgta actcgaccct 
tctgcatcga tcactaagac ttcctggaag 
ctgtccttct ggccagcctg tggtctggcc 
cacaggcagc ctgggaacag ctccatcccc 
ccggccaggg gaagaagctg ccgttgttct 
cccctcagct ccttctcggc cttgtctctc 
aaaaggcccg ggaggaggca cggtgatgac 
tcatccggac gcgggctttc ccccactcgg 
ctcctcctca gcaataggac aagatggtca 
gtcctcagga acggagcaga ctggtacagg 
ggtcctcttg ccatcaggag gaatagcagg 
ggccgggctc ggtcagtctc cagctgggat 
agacagactg cctgactgtg tgtcatcagg 
ctcacctata aaatggaact aataacgatg 
tagctgaaga gaactaatat aacatgaaag 
gcagattcca attttattcg agggcagcgt 
gtcagggggc ctggttctga gtcctagttc 
agccactgga cctctctgtt cctctgcaaa 
ccaggatgca gtaggtccca gggactatcc 
cagcgggaac cctgttccgg ggctcacatt 
acagaataag tgcataacta catggtacag 
atgagagagt tgagagtgcc cggtgtggtg 
gggaaacctc agggaaggtg aggactatgg 
gccctgcaaa tggcaggcac cacaggagca 
cccgggagaa gtagggagac atagaagggg 
aggagaaggc ggaggggcac tcagggcgag 
tactcccagg acgcggctgc gtgcagacct 



ctgccctctg 


cacctccggc 


ctgcatgtcc 


2640 


gctgggt cag 


aaggcctggg 


tggttggcct 


2700 


cgtttctggg 


aaccttggcc 


ccgactcctg 


2760 


gcaccggctc 


actctgttca 


gcagtgaaac 


2820 


aggtcccagc 


gtgagtgtcg 


cttctggcat 


2880 


aagtgatgta 


accctcctct 


ccagcctgtg 


2940 


acccctcagc 


tataaatagg 


gcatcgtgac 


3000 


gggtactaca 


gcagaaggta 


agccgggggc 


3060 


tcagatgtaa 


ctgagctgtg 


ggctaggagg 


3120 


tgaaaaacct 


ctcccctctc 


ataagaccag 


3180 


tgcccacctg 


gggtcttaca 


ggaggagctg 


3240 


ggtcttcctg 


ttccgctgag 


aaaagttagg 


3300 


aacagagtca 


tcatggccaa 


gagtccaccg 


3360 


gcttgtgcag 


gaattggggc 


tggagggaag 


3420 


ccccagagtg 


gtcaccctac 


ccctccctcg 


3480 


ctggtcaccg 


tctccctgaa 


cctcgatttg 


3540 


cctgggctcc 


ctgtctcagg 


ggctctggta 


3600 


tgctttctaa 


gctttgggat 


aagctaaaag 


3660 


agattggtgc 


ttcagctcgt 


ggatgacaga 


3720 


tgtctcttcc 


cagctgtgtg 


acgttgaaca 


3780 


acagcatgaa 


ccaattcatt 


aactacttct 


3840 


t'aggaatgtg 


ggctgtatta 


gtaaacacaa 


3900 


cacatcagag 


caaacagaca 


aagacgctgg 


3960 


agggttataa 


ggagggaaaa 


ggggagctgg 


4020 


/t ri a a /t rt 

ggyeJdayi— i_y 


(~> _a r-r r~r (~r +- /—J O a a 


i_ o o i_ y o q i_ <w a 


4080 


tgaggtcaga 


ggggttgata 


tgagaacagt 


4140 


tgagccgtca 


tcttcacctt 


tagcattcag 


4200 


caggtgctgg 


ccaagaggca 


ggggcaggag 


4260 


ggtgtcaggc 


ccgccacccc 


agagcaccat 


4320 


ggaaccagcc 


tagggagcag 


ccgcagatca 


4380 
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caactgagaa 


caaacgacag 


tctctgcctc 


aaaaatggcc 


catggaattg 


cgtctctgga 


4440 


gacgctgcct 


gagcaggagc 


agcacagtga 


gcgggctgca 


tcgaccagcg 


ccatccaaac 


4500 


cccgaacagt 


tggcgcttgt 


caggcaggac 


ttcccagcag 


tcggttccca 


caggtttccc 


4560 


ctgttgacct 


gatttgatgt 


gactgtctag 


attaggtgtg 


aactggtggc 


ttaggcttct 


4620 


ctgcacagaa 


aggcctgcaa 


gcagcagaga 


gagttttctg 


ttccattttt 


ccatgtcatg 


4680 


tggctcttcc 


tgagaacagc 


ggatggagtc 


aaatgcatgg 


ggagtggggt 


gagatggtag 


4740 


ctgaggtcag 


aatttggcat 


ttgaatgact 


gaagcagaac 


aaaacacacc 


aggtacttca 


4800 


gcagctgcac 


cgtgttgagg 


gcaggtgctg 


gttacgggtc 


tgggtgaggg 


aagccagctg 


4860 


ccaatgtaag 


aagaatgact 


gggtatgctt 


agatgaagca 


gaaaaatcta 


ggcatcaagg 


4920 


tggccttgag 


tcagtgatga 


cacgctacag 


ctccaaggaa 


gcctggccta 


gccctggggg 


4980 


gacagaaaag 


gccaagaagt 


gacgatattg 


cagtacaccc 


ccctccacaa 


gaaatgagtg 


5040 


agatgtggta 


caaaatgtta 


gaattgaatg 


aatcaataga 


ataaacgttc 


atcccttcaa 


5100 


tcaagaagag 


tcagatgaaa 


tgaattagca 


gggccagccc 


aagaacctct 


tctgggggtc 


5160 


tcagggtagc 


tttcatttgt 


agcagctgag 


gcggaaccag 


ctgtaaggcc 


tttgagagaa 


5220 


cgtggtgctg 


gacccgtgtc 


tagggcaggg 


gttctaaacc 


ctgcttacat 


atcagagtca 


5280 


cctgagaatt 


ttctattttt 


tttttttt 








5308 



<210> 2 
<211> 1159 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (151) . . (1009) 
<223> exon 2 of AGT gene 

<400> 2 

aatccatttc ttcttttttt tgtagacatg aatggattta ttctgggcta aatggtgaca 60 

gggaatattg agacaatgaa agatctggtt agatggcact taaaggtcag ttaataacca 120 

cctttcaccc tttgcaaaat gatatttcag ggtatgcgga agcgagcacc ccagtctgag 180 

atggctcctg ccggtgtgag cctgagggcc accatcctct gcctcctggc ctgggctggc 240 

ctggctgcag gtgaccgggt gtacatacac cccttccacc tcgtcatcca caatgagagt 300 

acctgtgagc agctggcaaa ggccaatgcc gggaagccca aagaccccac cttcatacct 360 

gctccaattc aggccaagac atcccctgtg gatgaaaagg ccctacagga ccagctggtg 420 

ctagtcgctg caaaacttga caccgaagac aagttgaagg ccgcaatggt cgggatgctg 480 



gccaacttct tgggcttccg tatatatggc 
ggggccaccg tcctctcccc aacggctgtc 
gccttggacc acacagctga caggctacag 
aactgcacct cccggctgga tgcgcacaag 
ctgctagtgg cccagggcag ggctgatagc 
ggcgtgttca cagccccagg cctgcacctg 
tatacccctg tggtcctccc acgctctctg 
aagattgaca ggttcatgca ggctgtgaca 
gccagtgtgg acagcaccct ggctttcaac 
ctctctgctg gctctggccc taggacttag 
aggccttgga gatgggcagg gggcagccct 
gtggggaagt ggcctctcc 

<210> 3 

<211> 568 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (151) . . (418) 
<223> exon 3 of AGT gene 

<400> 3 

ccaggagaat gcagagtggc ctgggggggg 
gcaggatctg ggcctttggt gtctgccagc 
ttccctccct gtctcctgtc tggatttcag 
agccccagga gttctgggtg gacaacagca 
tgggcacctt ccagcactgg agtgacatcc 
tcactgagag cgcctgcctg ct.gctgat.cc 
tggagggtct cactttccag caaaactccc 
aggagcctcc cggtctcccc tggaatgtgg 
ggggtgggga gtagacacac ctgagctgag 
tggcacgggg ctgggcaggc ggcctgtg 

<210> 4 
<211> 445 



5 

atgcacagtg agctatgggg cgrggtccat 540 
tttggcaccc tggcctctct ctatctggga 600 
gcaatcctgg gtgttccttg gaaggacaag 660 
gtcctgtctg ccctgcaggc tgtacagggc 720 
caggcccagc tgctgctgtc cacggtggtg 780 
aagcagccgt ttgtgcaggg cctggctctc 840 
gacttcacag aactggatgt tgctgctgag 900 
ggatggaaga ctggctgctc cctgacggga 960 
acctacgtcc acttccaagg taaggcaaac 1020 
tatccaatgt gtagctgaga tcagccagtc 1080 
gcggacatac ctggtgacca cccttgagaa 1140 

1159 



ccaaggaagg aggctgaggc agggcgaggg 60 
cctcattcct gcccctgtct tgggtgactc 120 
ggaagatgaa gggcttctcc ctgctggccg 180 
cctcagtgtc tgttcccatg ctctctggca 240 
aggacaactt ctcggtgact caagtgccct 300 
ag.cct.cac.ta tgcctctgac ctggacaagg 360 
tcaactggat gaagaaactg tctccccggt 420 
gagccacact gtcctgccca ggctgggggc 480 
ccttgggtgc agagcagggc aggg:ccgcgg 540 

568 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (151) . . (295) 
<223> exon 4 of AGT gene 

<400> 4 

cccaaggggg aggatgactt agttgggtga tggggggagc aaacatccct gtcgtcaggg 60 
ttgggtgcaa ggagcataag cctgcctggc ctctgggaga gccctcactg tgtggcctgg 120 
agccttccta actgtgcatc atctccccag gaccatccac ctgaccatgc ccca.actggt 180 
gctgcaagga tcttatgacc tgcaggacct gctcgcccag gctgagctgc ccgccattct 240. 
gcacaccgag ctgaacctgc aaaaattgag caatgaccgc atcagggtgg gggaggtatg 300 
tgtgagcctg tgtctgtgcc tgacctgggt tccaagtgtg cacagggtgg gaggcatgga 360 
tgtaagggac acagaggagg ctatgggtgg ggccagcagg gcaagaggga gcggagagta 420 
gggccaaagg tgggagagaa gtagc 4 45 

<210> 5 

<211> 1091 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (151) . . (941) 
<223> exon 5 of AGT gene 

<400> 5 

actgtgcatc ctcctggcag gctggaaaat gtccccctcc aggacagtgc acagcacaga 60 

ggtcctgagc ccaccccggc cctctagccc tcagcaccct gggtcaccca gtgcgccctc 120 

agaatgatcc tgatgtctgc tgctttgcag gtgctgaaca gcattttttt tgagcttgaa 180 

gcggatgaga gagagcccac agagtctacc caacagctta acaagcctga ggtcttggag 240 

gtgaccctga accgcccatt cctgtttgct gtgtatgatc aaagcgccac tgccctgcac 300 

ttcctgggcc gcgtggccaa cccgctgagc acagcatgag gccagggccc cagaacacag 360 

tgcctggcaa ggcctctgcc cctggccttt gaggcaaagg ccagcagcag ataacaaccc 420 

cggacaaatc agcgatgtgt cacccccagt ctcccacctt ttcttctaat gagtcgactt 480 

tgagctggaa agcagccgtt tctccttggt ctaagtgtgc tgcatggagt gagcagtaga 540 

agcctgcagc ggcacaaatg cacctcccag tttgctgggt ttattttaga gaatgggggt 600 

ggggaggcaa gaaccagtgt ttagcgcggg actactgttc caaaaagaat tccaaccgac 660 

cagcttgttt gtgaaacaaa aaagtgttcc cttttcaagt tgagaacaaa aattgggttt 720 



7 



taaaattaaa gtatacattt ttgcattgcc ttcggtttgt atttagtgtc ttgaatgtaa 780 
gaacatgacc tccgtgtagt gtctgtaata ccttagtttt ttccacagat gcttgtgatt 840 
tttgaacaat acgtgaaaga tgcaagcacc tgaatttctg tttgaatgcg gaaccatagc 900 
tggttatttc tcccttgtgt tagtaataaa cgtcttgcca caataagcct ccaaaaattt 960 
tatctttcat ttagcagcca aacagatgta tacaattcag cagatagact gtgcaaacga 1020 
aagtgctttc ctggactttg gatggaattt ccatgggagg tctgagccag tacttagcag 1080 
tcctttgaag t 1091 

<210> 6 
<211> 2099 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (40) . . (1494) 

<220> 

<221> sig — peptide 
<222> (40) . . (138) 

<400> 6 

aagaagctgc cgttgttctg ggtactacag cagaagggt atg egg aag cga gca 54 

Met Arg Lys Arg Ala 
1 5 

ccc cag tct gag atg get cct gcc ggt gtg age ctg agg gcc acc ate 102 
Pro Gin Ser Glu Met Ala Pro Ala Gly Val Ser Leu Arg Ala Thr lie 
10 15 20 

etc tgc etc ctg gcc tgg get ggc ctg get gca ggt gac egg gtg tac 150 
Leu Cys Leu Leu Ala Trp Ala Gly Leu Ala Ala Gly Asp Arg Val Tyr 
25 30 35 

ata cac ccc ttc cac etc gtc ate cac aat gag agt acc tgt gag cag 198 
lie His Pro Phe His Leu Val lie His Asn Glu Ser Thr Cys Glu Gin 
40 45 50 

ctg gca aag gcc aat gcc ggg aag ccc aaa gac ccc acc ttc ata cct 246 
Leu Ala Lys Ala Asn Ala Gly Lys Pro Lys Asp Pro Thr Phe lie Pro 
55 60 65 

get cca att cag gcc aag aca tec cct gtg gat gaa aag gcc eta cag 294 
Ala Pro lie Gin Ala Lys Thr Ser Pro Val Asp Glu Lys Ala Leu Gin 
70 75 80 85 

gac cag ctg gtg eta gtc get gca aaa ctt gac acc gaa gac aag ttg 342 
Asp Gin Leu Val Leu Val Ala Ala Lys Leu Asp Thr Glu Asp Lys Leu 
90 95 100 

aag gcc gca atg gtc ggg atg ctg gcc aac ttc ttg ggc ttc cgt ata 390 
Lys Ala Ala Met Val Gly Met Leu Ala Asn Phe Leu Gly Phe Arg lie 
105 110 115 
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tat ggc atg cac agt gag eta tgg ggc gtg gtc cat ggg gec acc gtc 438 

Tyr Gly Met His Ser Glu Leu Trp Gly Val Val His Gly Ala Thr Val 

120 125 130 

etc tec cca acg get gtc ttt ggc acc ctg gec tct etc tat ctg gga 486 

Leu Ser Pro Thr Ala Val Phe Gly Thr Leu Ala Ser Leu Tyr Leu Gly 

135 140 145 

gee ttg gac cac aca get gac agg eta cag gca ate ctg ggt gtt cct 534 

Ala Leu Asp His Thr Ala Asp Arg Leu Gin Ala He Leu Gly Val Pro 

150 155 160 165 

tgg aag gac aag aac tgc acc tec egg ctg gat gcg cac aag gtc ctg 582 

Trp Lys Asp Lys Asn Cys Thr Ser Arg Leu Asp Ala His Lys Val Leu 

170 175 180 

tct gec ctg cag get gta cag ggc ctg eta gtg gec cag ggc agg get 630 

Ser Ala Leu Gin Ala Val Gin Gly Leu Leu Val Ala Gin Gly Arg Ala 

185 190 195 

gat age cag gec cag ctg ctg ctg tec acg gtg gtg ggc gtg ttc aca 678 

Asp Ser Gin Ala Gin Leu Leu Leu Ser Thr Val Val Gly Val Phe Thr 

200 205 210 

gee cca ggc ctg cac ctg aag cag ccg ttt gtg cag ggc ctg get etc 726 

Ala Pro Gly Leu His Leu Lys Gin Pro Phe Val Gin Gly Leu Ala Leu 

215 220 225 

tat acc cct gtg gtc etc cca cgc tct ctg gac ttc aca gaa ctg gat 774 

Tyr Thr Pro Val Val Leu Pro Arg Ser Leu Asp Phe Thr Glu Leu Asp 

230 235 240 245 

gtt get get gag aag att gac agg ttc atg cag get gtg aca gga tgg 822 

Val Ala Ala Glu Lys He Asp Arg Phe Met Gin Ala Val Thr Gly Trp 

250 255 260 

aag act ggc tgc tec ctg acg gga gee agt gtg gac age acc ctg get 870 

Lys Thr Gly Cys Ser Leu Thr Gly Ala Ser Val Asp Ser Thr Leu Ala 

265 270 275 

ttc aac acc tac gtc cac ttc caa ggg aag atg aag ggc ttc tec ctg 918 

Phe Asn Thr Tyr Val His Phe Gin Gly Lys Met Lys Gly Phe Ser Leu 

280 285 290 

ctg gec gag ccc cag gag ttc tgg gtg gac aac age acc tea gtg tct 966 

Leu Ala Glu Pro Gin Glu Phe Trp Val Asp Asn Ser Thr Ser Val Ser 

295 300 305 

gtt ccc atg etc tct ggc atg ggc acc ttc cag cac tgg agt gac ate 1014 

Val Pro Met Leu Ser Gly Met Gly Thr Phe Gin His Trp Ser Asp He 

310 315 320 325 

cag gac aac ttc teg gtg act caa gtg ccc ttc act gag age gec tgc 1062 

Gin Asp Asn Phe Ser Val Thr Gin Val Pro Phe Thr Glu Ser Ala Cys 

330 335 340 

ctg ctg ctg ate cag cct cac tat gee tct gac ctg gac aag gtg gag 1110 

Leu Leu Leu He Gin Pro His Tyr Ala Ser Asp Leu Asp Lys Val Glu 

345 350 355 



»1 U UU/ M /Di 



re A/couu/ioo// 
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ggt etc act ttc cag caa aac tec etc aac tgg atg aag aaa erg tct 1158 
Gly Leu Thr Phe Gin Gin Asn Ser Leu Asn Trp Met Lys Lys Leu Ser 
360 365 370 

ccc egg acc ate cac ctg ace atg ccc caa ctg gtg ctg caa gga tct 1206 
Pro Arg Thr lie His Leu Thr Met Pro Gin Leu Val Leu Gin Gly Ser 
375 380 385 



tat gac ctg cag gac ctg etc gec cag get gag ctg ccc gec att ctg 
Tyr Asp Leu Gin Asp Leu Leu Ala Gin Ala Glu Leu Pro Ala lie Leu 
390 395 400 405 



1254 



cac acc gag ctg aac ctg caa aaa ttg age aat gac cgc ate agg gtg 1302 
His Thr Glu Leu Asn Leu Gin Lys Leu Ser Asn Asp Arg lie Arg Val 
410 415 420 

ggg gag gtg ctg aac age att ttt ttt gag ctt gaa gcg gat gag aga 1350 
Gly Glu Val Leu Asn Ser lie Phe Phe Glu Leu Glu Ala»Asp Glu Arg 
425 430 435 



gag ccc aca gag tct acc caa cag ctt aac aag cct gag gtc ttg gag 
Glu Pro Thr Glu Ser Thr Gin Gin Leu Asn Lys Pro Glu Val Leu Glu 
440 445 450 



1398 



gtg acc ctg aac cgc cca ttc ctg ttt get gtg tat gat caa age gee 1446 
Val Thr Leu Asn Arg Pro Phe Leu Phe Ala Val Tyr Asp Gin Ser Ala 
455 460 465 

act gee ctg cac ttc ctg ggc cgc gtg gee aac ccg ctg age aca gca 1494 
Thr Ala Leu His Phe Leu Gly Arg Val Ala Asn Pro Leu Ser Thr Ala 
470 475 480 485 



tgaggecagg 


gccccagaac 


acagtgcctg 


gcaaggcctc 


tgcccctggc 


ctttgaggca 


1554 


aaggecagea 


gcagataaca 


accccggaca 


aatcagegat 


gtgtcacccc 


cagtctccca 


1614 


ccttttcttc 


taatgagtcg 


actttgagct 


ggaaagcagc 


cgtttctcct 


tggtctaagt 


1674 


gtgctgcatg 


gagtgagcag 


tagaagectg 


cagcggcaca 


aatgcacctc 


ecagtttget 


1734 


gggtttattt 


tagagaatgg 


gggtggggag 


gcaagaacca 


gtgtttagcg 


egggactact 


1794 


gttccaaaaa 


gaattccaac 


cgaccagctt 


gtttgtgaaa 


caaaaaagtg 


ttcccttttc 


1854 


aagttgagaa 


caaaaattgg 


gttttaaaat 


taaagtatac 


atttttgeat 


tgccttcggt 


1914 


ttgtatttag 


tgtcttgaat 


gtaagaacat 


gacctccgtg 


tagtgtctgt 


aataccttag 


1974 


ttttttccac 


agatgettgt 


gatttttgaa 


caatacgtga 


aagatgcaag 


cacctgaatt 


2034 


tctgtttgaa 


tgeggaacca 


tagctggtta 


tttctccctt 


gtgttagtaa 


taaaegtett 


2094 


gecac 












2099 



<210> 7 
<211> 485 
<212> PRT 

<213> Homo sapiens 
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<400> 7 

Met Arg Lys Arg Ala Pro Gin Ser Glu Met Ala Pro Ala Gly Val Ser 
1 5 10 15 

Leu Arg Ala Thr lie Leu Cys Leu Leu Ala Trp Ala Gly Leu Ala Ala 
20 25 30 

Gly Asp Arg Val Tyr lie His Pro Phe His Leu Val lie His Asn Glu 
35 40 45 

Ser Thr Cys Glu Gin Leu Ala Lys Ala Asn Ala Gly Lys Pro Lys Asp 
50 55 60 

Pro Thr Phe lie Pro Ala Pro lie Gin Ala Lys Thr Ser Pro Val Asp 
65 70 75 80 

Glu Lys Ala Leu Gin Asp Gin Leu Val Leu Val Ala Ala Lys Leu Asp 
85 90 95 

Thr Glu Asp Lys Leu Lys Ala Ala Met Val Gly Met Leu Ala Asn Phe 
100 105 110 

Leu Gly Phe Arg lie Tyr Gly Met His Ser Glu Leu Trp Gly Val Val 
115 120 125 

His Gly Ala Thr Val Leu Ser Pro Thr Ala Val Phe Gly Thr Leu Ala 
130 135 140 

Ser Leu Tyr. Leu Gly Ala Leu Asp His Thr Ala Asp Arg Leu Gin Ala 
145 150 155 160 

lie Leu Gly Val Pro Trp Lys Asp Lys Asn Cys Thr Ser Arg Leu Asp 
165 . 170 175 

Ala His Lys Val Leu Ser Ala Leu Gin Ala Val Gin Gly Leu Leu Val 
180 185 190 

Ala Gin Gly Arg Ala Asp Ser Gin Ala Gin Leu Leu Leu Ser Thr Val 
195 200 205 

Val Gly Val Phe Thr Ala Pro Gly Leu His Leu Lys Gin Pro Phe Val 
210 215 220 

Gin Gly Leu Ala Leu Tyr Thr Pro Val Val Leu Pro Arg Ser Leu Asp 
225 230 235 240 

Phe Thr Glu Leu Asp Val Ala Ala Glu Lys lie Asp Arg Phe Met Gin 
245 250 255 

Ala Val Thr Gly Trp Lys Thr Gly Cys Ser Leu Thr Gly Ala Ser Val 
260 265 270 

Asp Ser Thr Leu Ala Phe Asn Thr Tyr Val His Phe Gin Gly Lys Met 
275 280 285 

Lys Gly Phe Ser Leu Leu Ala Glu Pro Gin Glu Phe Trp Val Asp Asn 
290 295 300 

Ser Thr Ser Val Ser Val Pro Met Leu Ser Gly Met Gly Thr Phe Gin 
305 310 315 320 
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His Trp Ser Asp lie Gin Asp Asn Phe Ser Val Thr Gin Val Pro Phe 
325 330 335 

Thr Glu Ser Ala Cys Leu Leu Leu lie Gin Pro His Tyr Ala Ser Asp 
340 345 350 

Leu Asp Lys Val Glu Gly Leu Thr Phe Gin Gin Asn Ser Leu Asn Trp 
355 360 365 

Met Lys Lys Leu Ser Pro Arg Thr lie His Leu Thr Met Pro Gin Leu 
370 375 380 

Val Leu Gin Gly Ser Tyr Asp Leu Gin Asp Leu Leu Ala Gin Ala Glu 
385 390 395 400 

Leu Pro Ala lie Leu His Thr Glu Leu Asn Leu Gin Lys Leu Ser Asn 
405 410 415 

Asp Arg lie Arg Val Gly Glu Val Leu Asn Ser lie Phe Phe Glu Leu 
420 • 425 430 

Glu Ala Asp Glu Arg Glu Pro Thr Glu Ser Thr Gin Gin Leu Asn Lys 
435 440 445 

Pro Glu Val Leu Glu Val Thr Leu Asn Arg Pro Phe Leu Phe Ala Val 
450 455 460 

Tyr Asp Gin Ser Ala Thr Ala Leu His Phe Leu Gly Arg Val Ala Asn 
465 470 475 480 

Pro Leu Ser Thr Ala 
485 



<210> 8 
<211> 681 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (151) . . (531) 

<223> altenate exon 1 of AGT gene 
<400> 8 

agggcgggga aggggggagg aagggggact gagactctcc tgtttctata tgctatgatg 60 
ggatttgacc tgcaattgaa cctttcctca aacacaaggc acaaacctgc cccagcccat 120 
catcctccca cagaggcact gtgaatgtat aaagaaaggc gatgctaaga aacgtgctgc 180 
aggggaggct ggcacattct gctccttagc taaccgcaca ggcccatggg tccacacaca 240 
tcagaccatg gctctctgtg gtctatcttt gccattactt taaccaggct gtttctcatg 300 
cgctcatgga gggtgactga agatgccacc ccgaggtgca ccctcagcct cgctggacac 360 
ttacggctcg cattctccag ccggaccagg cagttgagga agtcatcgaa gtccagctgg 420 
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agctcctcat ccgcatacct gagcacaatc agctgcagga ggtggctgct cagctgaaag 480 
cctgtgggga caaaggagga gacccaggac tcgctgtatc cgagccccga ggtgagtctg 540 
cacctttgag ggagctccca ggcagctgag gagagggctc agaagaggaa gccactgttt 600 
cactcctcaa agctggggtc cccagaccca gtgggactgc tgcccttggt gtgaatgcag 660 
cgtgagagtg tgaatctgtg g 681 

<210> 9 
<211> 18 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :M13 forward 
primer 

<400> 9 

gttttcccag tcacgacg 18 

<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :M13 reverse 
primer 

<400> 10 

aggaaacagc tatgaccat 19 

<210> 11 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 11 

cctcgaagtg gcctatggaa 20 



<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 12 

gcgagactct atgggtgatg 



<210> 13 
<211> 20 
<212> DNA 
<213> Homo 



sapiens 



V* UU/ i A t Da 



<400> 13 

gactgagact ctcctgtttc 



<210> 14 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cggggtggca tcttcagtc 



<210> 15 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 15 

tggcacattc tgctccttag 



<210> 16 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 16 

atacagcgag tcctgggtct 



<210> 17 
<211> 20 
<212> DNA 
<213> Homo 

<400> 17 
gaccaggcag 

<210> 18 
<211> 20 
<212> DNA 
<213> Homo 

<400> 18 
gattcacact 



sapiens 
ttgaggaagt 



sapiens 
ctcacgctgc 



<210> 19 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ccactttcca gcatgttacc 



<210> 20 
<211> 20 
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<212> DNA 

<213> Homo sapiens 
<400> 20 

ctgctattcc tcctgatggc 



<210> 21 
<211> 20 
<212> DNA 
<213> Homo 

<400> 21 
acaggcccat 



sapiens 
gtgagatatg 



<210> 22 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 22 

tggtattagg gcactgaagg 



<210> 23 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 23 

tgtgttgaag caggatcttc 



<210> 24 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 24 

gatgcaggca ttgaaagatg 



<210> 25 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 25 

gtgtttaaca gtctccccag c 



20 



20 



20 



21 



<210> 26 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 26 

ctaggtgtgt gacagcctga 



20 



<210> 27 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 27 

catgtccctg tggcctctt 



<210> 28 
<211> 18 
<212> DNA 
<213> Homo 

<400> 28 
tgaggggtgg 



sapiens 
ggatggag 



<210> 29 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 29 

ctggtcatgt gaaacttacc 



<210> 30 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 30 

accttcagct gctccaaaga 



<210> 31 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 31 

ccactttcca gcatgttacc 



<210> 32 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 32 

ctgctattcc tcctgatggc 



<210> 33 
<211> 18 
<212> DNA 
<213> Homo 



sapiens 
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<400> 33 

aaccctcctc tccagcct 18 



<210> 34 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 34 

tcatcaccgt gcctcctc 18 



<210> 35 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 35 

tacttggact ttgggctgag 20 



<210> 36 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 36 

ttgagcaagg cactttgttt c 



<210> 37 
<211> 21 
<212> DNA 
<213> Homo 



sapiens 



<400> 37 

ggctaaatgg tgacagggaa 



<210> 38 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 38 

tagggccttt tcatccacag 



<210> 39 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 39 

aatgccggga agcccaaaga 



<210> 40 
<211> 20 



17 



<212> DNA 

<213> Homo sapiens 



<400> 40 

tccaaggctc ccagatagag 



20 



<210> 41 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 41 

ctctccccaa cggctgtctt 



20 



<210> 42 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 42 

ctgcacaaac ggctgcttca 



20 



<210> 43 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 43 

ctctatctgg gagccttgga 



20 



<210> 44 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 44 

cacagcctgc atgaacctgt 



20 



<210> 45 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 45 

tgaagcagcc gtttgtgcag 



20 



<210> 46 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 46 

tccccacttc tcaagggtg 



19 
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<210> 47 
<211> 20 
<212> DNA 
<213> Homo 

<400> 47 
acaggttcat 



sapiens 
gcaggctgtg 



<210> 48 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 48 

tccccacttc tcaagggtg 



<210> 49 
<211> "21 
<212> DNA 

<213> Homo sapiens 
<400> 49 

tgcctcagtg aactcaaatg g 



<210> 50 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 50 

ctccccctcc aaaactacca 



<210> 51 
<211> 19 
<212> DNA 
<213> Homo 

<400> 51 
caggagaatg 



sapiens 
cagagtggc 



<210> 52 
<211> 19 
<212> DNA 
<213> Homo 

<400> 52 
gcaggcgctc 

<210> 53 
<211> 20 
<212> DNA 
<213> Homo 



sapiens 
tcagtgaag 

sapiens 
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<400> 53 

agcactggag tgacatccag 



20 



<210> 54 
<211> 20, 
<212> DNA 

<213> Homo sapiens 
<400> 54 

gatggaggac tggtagacag 



<210> 55 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 55 

aagggggagg atgacttagt 



<210> 56 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 56 

gaaattccat ccaaagtcca g 



<210> 57 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 57 

gtgcaaggag cataagcctg 



<210> 58 
<211> 19 
<212> DNA 
<213> Homo 

<400> 58 
ctactctccg 



sapiens 
ctccctctt 



<210> 59 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 59 

aagggggagg atgacttagt 



<210> 60 
<211> 21 
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<212> DNA 

<213> Homo sapiens 
<400> 60 

gaaattccat ccaaagtcca g 



<210> 61 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 61 

tagccctcag caccctg 



<210> 62 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 62 

tgtccggggt tgttatctg 



<210> 63 
<211> 20 
<212> DNA 
<213> Homo 

<400> 63 
agaacacagt 



sapiens 
gcctggcaag 



<210> 64 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 64 

ttcacaaaca agctggtcgg 



<210> 65 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 65 

agtgtttagc gcgggactac 



<210> 66 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 66 

tttggaggct tattgtggca 



21 



<210> 67 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 67 

tgaaagatgc aagcacctga 



<210> 68 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 68 

gaaagcactt tcgtttgcac 



20 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/USOO/13327 



A. CLASSIFICATION OF SUBJECT MATTER 
1PC(7) :Plea*e See Extra Sheet 
US CL Please See Extra Sheet 
According to International Patent Classification (IPC) or to both national classification and IPC 



FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/6. 69.1. 91.1. 91.31. 455. 375; 514/2, 4, 44; 536/23.1, 23,4. 23.5, 23.51. 243. 24.5 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where practicable, search terms used) 
Dialog. WEST. Sequence searches: amino acid and nucleotide data bases (i.e. no size limits and oligo search). 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Y, A 



US 5,589,584 A (LALOUEL1 et aL) 31 December 1996, entire text, 
claims 1-3. 

O'BYRNE et al. Genetics of Hypertension. Therapeutic 
Implications. Drugs. August 1998, Vol. 56, No. 2, pages 203-214, 
especially pages 208-209. 

HIXSON et al. Detection and characterization of new mutations in 
the human angiotensinogen gene (AGT). Hum. Genet. 1995, Vol. 
96, pages 110-112, see entire document. 



1-7 



1-7 



1-7 



X Further documents are listed in the continuation of Box C. | | See patent family annex. 



Special categories of cited document!: 

document defining tha general state of the art which is not considered 
to be of particular relevance 

earlier document published on or after the international filing data 

document which may throw doubts on priority claim (s) or which is 
cited to establish the publication data of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use, exhibition or other 
document published prior to the international Tiling date but later than 



later document published after the international filing date or priorit y 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 



document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
10 AUGUST 2000 


Date of mailing of the international search report 

'02 OCT 2000 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington. D.C. 20231 
Facsimile No. (703) 305-3230 


'JANE zaka / * 

Telephone No. (703) 308-0 1 96 



Form PCT/1SA/210 (second sheet) (July 1998)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/USOO/13327 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Y.P 
X 
Y 
X 

X 
Y 



OHNO et al. Association analyses of the polymorphisms of 
angiotension-converting enzyme and angiotensionogen genes with 
diabetic nephropathy in Japanese non-insulin-dependent diabetics. 
Metabolism Clin, and Expl. 1996, Vol. 45, No. 2, pages 218-222, 
especially pages 218 and 219. 

WO 97/18312 Al (VIMRX HOLDINGS LTD.) 22 May 1997, 
pages 4-1 5 and especially page 19. 

US 5,916,907 A (BIRD) 29 June 1999, col. 1, lines 17-55. 
WO 97/28684 Al (ISHIKAWA et al.) 14 August 1997, abstract. 



WO 95/19568 Al (RATH, M.) 20 July 1995, abstract, page 76, 
claim 5. 

WO 98/58952 Al (PROTEUS MOLECULAR DESIGN LIMITED) 
30 December 1998, see entire document 



1-7 



8-11 and 16 
12,13 

8-11, 14-18 
8-11, 14-18 
8-11 and 16 

8-11, 16 
18 



Form PCT/1SA/210 (continuation of second sheet) (July 1998)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US00/13327 



A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (7): 

C12Q 1/68; C12P 21/06. 19/34; AO IN 43/04, 37/18; A61K 38/00 t 38/28; C07H 21/02, 21/04 

A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 

435/6, 69.1. 91.1. 91 31, 455. 375; 514/2. 4. 44; 536/23.1. 23.4, 23.5, 23.51. 243, 24.5 



Form PCT/1SA/210 (cxtni sheet) (July 1998)* 



WOOD/71751 



rCT/USOft/13327 



1/4 




in 



CD 



in 

CD 



SUBSTITUTE SHEET (RULE IS) 



2/4 






SUBSTITUTE SHEET (RULE 26) 



WO OB/71731 



PCT/USOG/13321 



3/4 





SUBSTITUTE SHEET (RULE 26) 



vvu uuv r J / ^i. 



4/4 



as 




Eft m n u 

n3 Q O W 
5ft CJ>i — I ^ ©■» g 

M 



5 

CD 



CD 
CO 



sis 



o 

CO 

d 



SUBSTITUTE SHEET (RULE 26) 



WO 00/7 J 751 



PCT/USrOD/13327 



I 

SCU'JttNCS LISTING 



<11 n> McGrail, Maura 

>1y3 i^d (Senetics, Inc. 

<i20> *j 1 abides cznz 

<130> 7>!filjetee G&ne III 

<140> 
<141> 

<1D0"> US 60/174 , 7IJ0 

<1M> u£ GD/13S r 423 
<151> 1999-05-55] 

<lhll> 68 

<170> PatcnbTn Vrta . 2.0 

<210> 1 
<211> 

<5!l3> Homo sapiens 
<220> 

<221> ml Ao_f mature! 
<?.25!> [3012) . - {3047] 
<223> &xon 1 of AGT -gene 

<4 00> 1 





cttgacccaa 




Aoaaaacccc 


gtgttgtata 


tiaagta arjcjt 


60 


tta^igg^t r-t 


& c]q 14c : 1 . 4 l.ftl: 


craaagtggct 


ggggcci«<j^i*. 


l./^a^a t.t.aa 


eras cat ctca 


120 


gcaaagcaat 


tg^ttaaagt 


acagcb£L t.fc. 


r.tcaaaatgg 


agtctcttat 


gt &ttcsccl.\. 


160 


tct ?tt^t.<?q» 




gtctgatctc 


tctttttttt 


<:fcf:ta£ratat 


cccaatagca 


240 


rrrtgcctgac 


taatacatca 


tgctct-r?<:l,g 


actccatatg 


tggctggttt 


c^t-tjqat^r.g 


SOO 


at tot gauti.* 




r.tct cacgct 


get veil; «i«(»<] 


&i':ataccaga 


gactgggtaa 


360 


1. 1 1 a t aa a ga 


a aa a gaggtt 


t n^bggactr. 


atgtgttcca 


cgtgcjctqi.^ 


<99g4£Ctcac 


420 




gaaggtgaag 


gaggagcscK? 


MQttACfitCt^ 


acatggtggc 


t gg o&v a a 


4H0 


gagaaagcat 


gttc-agggraa 


Hf:l'.CCCCttC 


ata-aaacc^t 


cm y ate teat 


gagacttatg 


540 


C&i-tat.Caca 


agaacagcac 


ca<ra# ? l*a r. c 


caccctcatg 


attcaattofcc 


ixl.cccactgg 


600 


qtccctcccg tgaoacatg^i 


ijAattatggg 


a get acquit I 


^yagatttgg 


gtggggacoc 


650 




t.Attragatgg 


ct tattt^r/f- 


ttctatgtag 


aacctctgd; 


M*. (.cat tea a 


720 


CA</r.cttcat 


ttagccacag 


at ^A£/ctctg 


tccctaactt 


^C^T.qatgg 


aa tgtacaca 


760 



l:aAgaaactt ccactgatgg aot gaaccu.-o r^»i>yi.<ji^i. fr^rggg/ia^a aaaotggcct 640 
^aatotn^qc tqinil. I. gtctgcagtc agtttgaatg gctgctcctt atgggnatna 900 
tttacattct caataaaott ct ctoq L U.i.i*:tgAttg a 1 1 r.ta a tg a gctttaaagc 360 
r.rtlt^Ol.ag* agatc-cccca gctgatagtc agccttg^gc- citgrjcat tc*on <jiir;t.Ut.K.&wc IQ^U 
caatcttgca acaagtt-oa qumjAl'.atr.rr t.ttattgggt. ccaatctaac caaaattatt 1080 
l.L.i.r.tatgtt ctccccagta acgtgtcaft at t ctz\ i.i ci gas r;t b t.qq r; M* ij <:i. LAuAgyi-rr: 11 on 
aanttt ngc«] 9i?t;ir.:t.q«n» t.tt.tat tttc ttttacacca ctxtccagca tgttacctga 1200 
tcagttgttt attatctttg ctgtt ijci^L-r; qagl.ga r.<r:A*. tottaagggc-c cgaggcagga 126D 
i]r^;<:i^ggca r;Ag<:ggaAac tctcccaaag accaggatct ttgttttgtt ccct rjcicnt & 1320 
tgctgagcnc cag gad t t its fin* At.tfA aacaaattgt gaggctttaa agagccgaaa 13 BO 
tatttaaaca ctgggcacaa ggttgttgct taatcugtgc t a yea t crjt t c* £CL.ccc&£l'.i* 14^0 
gtgtccaggt cgacttgttu r;l.q u a gt.t a a accacttgct gatcctcaaa caactagtta 1SD0 
gl.ggcacagc caggoctagg accccagtct ctactgttcc a n ct a accca t t.c-q c*? q 1560 
ggagcaottt gyntggtctc M.Attf.taaa aaaattaaat taaaattgtc tatttattta 1620 
gagacagagt cttactctgt agcccoggct r.gemt.tjc*iq I- qQ l£i-a»lu^* tagcrtcactg 16 BO 
Uaacctcrcat a K. act. ggcc:t. caaaaagtgt ttgaattaca gntgegagge act ntat?crtq 17 4 0 
geccgaatgt t ctgtt cage* canagcu*™: r.^taagt.rgi;: tgtgggg ccc cagacaagtg 1B0O 
ejf.t.tttgagg riqtc.r.i:t.atc. tataggaaca aagtaattaa aaeaaatgtat tt^igeiattt 1U&0 
acaggcccat gtgagatatg atttttttua aUqv*q«U.l. * qagt t: gg gtaaaaaaga 192Q 
gqtatttgm ii)l.lvU?htq« lit-.gttcagtc agtgaatgta cagcttctgc ctcatiitccii 19B0 
ggcaccatct cttcctgctc tttgttnU.a aatgttccat ^cctgggtaa tt.tcat.gtct 204O 
gccatcgtgg atatgeegtg gctccttgan cctgcttfjty t tq a a a gq a t crt t rrctt c 2iQ0 
ctgtccc^tc agtgccctan 1 a rcn t gta t ttaaggctgg acacatcacB actcccaacc 2160 
tgcctcaccc actgcgtcac ttgtgatcac tggctYctgg cqei^tcti-^i- < : a a ggf.crr. c. t. 2220 
gtcatgccct gttataacga Dtac*u>W.|C aagtcrttaoc tataggaaaa taagaattat 2280 
asccctttta ctggtcatgt gaaacttacc atttgcacitt tqi.a<^g^Ai: aaacacagaa 2240 
cagcacatot ttcaaxgect gcciU-ctgaa ggcattttg^ ttgtgtcttt onatc?tg^c;t 2400 
gtgctattgt tggtgtttaa cagtctoo^c agctacactg gaaacttcca gaaggcactt 2 4 GO 
ttc^ct-iqct tntytritttt i^ccagtgtc tatiagaggc ctttqaac-^g ggtaggctct 2520 
ttggagcagc tgaaggtcac acatccColq agngggcagc agggtcagaa gtggcccccg 2580 
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tgttgcctaa gcaagactct cccctgccct ct gccct c-t g bacctcc-ggc ctgcatgtcc 26<0 
ctgtggcctc- c T.gggggUa f: aictcccggg gctgggtcag aaggcctggg tggttggcct 2700 
caggctgtca cacacctagg gagatgctcc cgtttct^^q asur.^t t ggcc ccgnctcctg 2760 
caaacttcgg ^aaar.gf.g-a actcgaccct gcaccggctc acictgitca gcagtgaaac- 2B20 
tc-gcatcga icactoagac ttcctiv?^» l ? » 1 ?<i t w j u tjt q;ic;tqtcg cttctgcicat 2B90 
ctgtccttct ggccagcctg rggtctggcc aagrgatgta accctcctct ccagcctgtg 2940 
<:ac:Ag r:l.iai)<j»*;:.H<j i^l.^catccco acccct cage taraaatagg gcatcgtgai-: 50 DO 

ccggccaggg gaagaagctg ccgttqttr.:t ij^q twUcy tji^yc^ngh y iiqr^r-qijrjqgo 30 00 
ccccrtcagct c^ttcrtcggc cttgtctctc tcagatgtaa ccgagctgtg ggctaggagg 3120 
a ^fcijg 1.^:13 gnctg gagged oggl.gAt.gA~ l:gaa*ia»K:f: I. I. trial: ft I. <: 1 . 1 - A P. h *g a i.^ fj 33 HH 
tcatccggac gcgggctttc ccccacrtcgg tgcccacctg gggtcttaca ggoggagctg 3240 
nt.crftt^ftl*.*:* qi^^i^g'.* 0 "- A^gatggtca ggtcttcctg tt. ecg cf.gag aaaagttagg 3 3 DO 
gtrcctcagga acggagcaga ctggtacagg aacsgagtca tcatggccaa gagtccaccg 33G0 
ggtcotcl.ti} ravwiirjflu q^at.agcagg gcrtr.gtgcag gaar.tggggr: r.ggagggaag 5420 
ggccgggctc ggtcagtctc cagctgggat ccccagagtg gtcarcctac ccctccctcg 34 80 
agacragar.tg ftf:U\£c.LgU* l;gr.catcagg crtggtcactrg tetccctgaa cctcgatttg 3540 
ctcacctata aaatggaact aataacgatg cctgggctcc ctgtctcagg ggctotggta 3600 
tagctgaaga gaarl..aa i.al. aacatgaaag tgctttctaa gctttgggat aagctaaaag 3660 
gcagattcca attttattcg ogggc&ncqt aqottyritnc ttcogctcgr ggatqncagn 3720 
gtcagggggc ctggtt.ct.ga gtcctagttc tgtctcttcc cagctgtgtg acgttgaaca 3780 
#g ct gg n c-ctctctgtt cr?.t-^*l.ii^^ MM w i :?i I ,q y k iU- L 1. CzA I . 1. &H ft La c T. 1: C L 3tt40 
ccaggatgca gtaggtccrca gggactatcc Caggaatgtg ggctgtatta gtaaacacaa 3900 
C.ag^gqgc7ac cctgttc-cgg i^nLcacstt cracatcagag craaacagaca aa gacgctgg 3960 
acagaataag tgcataacta catggtacag ogggttataa ggagggaaaa ggggagctgg -4 020 
augatid^ijt tgagcigtijcc ixggtgtggtg gggaaagctg ragggtgaaa tactgcatca 4 080 
gggaaacctc agggaaggtg aggactatgg tgaggtcnga ggggitgata t g aq-* t 4140 
gountgonoio tggcaggcac tv>oaggagca tgagccgt^a t ctx cracct t tagcattcag 4200 
cccgggagaa gtagggagac atagaagggg caggtgctgg ccua gngg g^gcaggar* 
aggctgaaggc ggaggggcac to^ggg^gag ggtgtcaggc ccgccacccc agagcaccat 4 320 
tactcrcagg acgnggctgc gtgcagacct gganccagcc taggn?«gc*n* ccqftAtjatoa 4 3fl0 



rv o ULii > i i'^i 



caactgagaa caoacgacns ti;l.<:vgccrc 
gncquUy:ftl: gagcaggagc agcacagtga 
c. c ccj a a ca gt i g gc g ct t gt caq r.*c*<3 n * < : 
ctgt ttj.?i^l. i^tr.tgargt gactgtctag 
ctgcacagaa agnr-otqco^ ^^ycagaga 
tggctcttcc tgagaacagc ggntggngtu 
ctgagqtocq nn\. 1. 1.<jyi:/»l. ttgaatgact 
gcagrtgcac cgtgttgagg gcnygtgcLu 
ccc»o?ti*l.Aiiy ftA^Aat.gA^t. gggtatgctt 
tggccttgag tcagtgatga c.uc<3ut<K:*y 
gacagaaaag gccaagaagt gacgatattq 
ag#t(;t.<3i)LA L^^rt^l^lA.A gaattgaatg 
tcaagaagag tcagatgaaa t goat tag oc» 
taaq^tago tttcattt.gt agcagctgag 
cgtggtgctg gacccgtgtc tagggcs^q 
ur-u^cfaMX l:lr.i:iar.i*.»vl:r. tr.tttttt 

<210> 2 
<?.ll> 115$ 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eatuxe 
<222> 1151) . . (10D9) 

<400> 2 

^^UcaUU: I.L.eU.ttl'.l-l. tgtagacatg 
gggaatattg agacaatgaa a gat ct.$g i I. 
C<:V.tl:caccc tM.<jr.jiAAftt. gatatttcag 
atggctcctg ccggtgtgag cctqa^qi-C 
ctggctqrifiq. q-f.gAm^ggt. gtacatacac 
acctgtgagc agctggcaaa ggcua^fg^o 
<jetc<^«ttc- ^i?^i--w^<3»4c: atcccctgtg 
ct agt cgct g caa a actT ga cact^^* f i * f: 



4 

aaaaatggcc catggaattg crjtctctg^c* 4440 

gcgggct<jtv. I, cr^fc i^c:a<j r:g crcatccaaac 45Q0 

l: \\<\ c:i:agcag tcggttccca caggtttccc -4560 

attaggtgtg n^o^ ii<jti*ii<: I. iraggottot 4 £20 

gagttttctg ttccattttt ccntgt cat g 4 680 

vwlQCftl.qq ggagt.qgggt gagatggtag 4 740 

gaagcagaac aa a acacacc oqrrtouttua *H00 

q 1 1. Ar: gggt. fi tgggtgaggg aagccagctg 4 8 GO 

agatgaagca giincuuitota <3i^Al.i^AA<jg 4 $20 

rcl.fccaa.ggaa gcctggcc^a gcccxggggg 4980 

c;?icjLmc-.^i;i:c: i^rce- L c-cac a a gaaatgagtg &04 0 

aatcaataga ataaacgt+c a t.vv-fct I. * Si UN 

^q^i^A^iu^: aagaacctct tctgggggtc 5160 

gcggaaccag ct gt u ag<i i-c t Li;*jagagaoi 522 D 

LjKl,ixhAa»»i^: rct.gcrttsrat a tea gag-tea 52 80 

530* 



aatggattta tvetgggota .™i<3gl.gaca 60 

ejcj^r.ggcarcr. taaaqgtcag ttaataacca 120 

ggtatgogga arjtr.yvntr.vi^ ccagtctgag ISO 

aocatcctct gcctcctggc ctgggctggc 24 0 

cccrttccncc t.r.:c]Li^l.r:M caa^gagagt 300 

gggaagecca aagaccccac ctt catac^t 360 

gatgaansigq i.x^.acAgga ccagctggtg 420 

aagt.tgaagg ccgcaatggt egggcatyctg 4130 
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(jcc»c»c v^go^i. r.ccg tatetiatggc 
ggggccaccg tcctctccuv xHtxiqatqtc 
gc.cttggacc scacagctgs cagq i-tv u» < i 
a o ct g ca t sitf l • <j f-gcg caca ag 

ctgctagtgg cccagggcay w<:ii)ALa§n 
ggwjt<nUU» wqrj^.cagq cctgcacctg 
tataccccrg tgglcctccc c*cnr-U:i^l-y 
%».7i4&l.l:gac:a ggttcatgca ggctgtgacci 
gocagtqtqq -» r *< '■ » '- ggctt-tcaac 
or.ctctgctg gctctggccc taggadtt iirj 
qaf.gggcagg gggcagccct 
gtggggaagt ggcctctcc 

<210> 3 
<211> 5G8 
<212> 7>wn 

'.-.213? Homo sapiens 
<220:» 

<222> (151) . . M1B) 

<223> exon 3 of AGT gene 

<41U» :* 

iK-^ggagaat. g^agagt.ggc stgggggggg 
gca gg at ct g ggcot 1 1 g gt g t*: I .gc:ca gc 
ttccctccct gtctcctgtc tggaVttcac* 
o gcc i±r& i|tj-^ ijl.l.iil.<i<jMi.q gacaacagca 
tgggcacctt ccagcactgg □ gtq a m t c;c 
tcactga5ur> cquctqcrl-.q i:tgctgatcc 
tggagggtct cactttccag caam<cl,fcfc^ 
agcjcKicctc"^ uii^t.ut.^cin- tggaatgtgg 
ggggtgggga gtagacacac ct^Agcrtgag 
tggct»c<3"jtj<5 iivqqqctAqqc: ggcctgtg 

<210> 4 
<211> 445 



5 

atgcacngtg agctatgggg cgt rjqt cwt 5*0 
tttggcaccc tggcctctct ctatctggga 000 
i4<:pj»l.iutl.Qy g L. gl I. f. 1.1- t.g gaaggacaag 660 
gtcctgtctg ccctgcaggc tgzacagggc "?20 
caggcccagc tgctgctgtc cacggtggrg 780 
aagcagccgt ttgtgcaggg cLvtqrit-.tul.c 
^fti*:l:ti:«icag aactggatgt tgctgctgag 300 
9 gcit ijgu ei gc» u t-<i<i i *i.< j uir: < : r. q a aq gg* "960 
acctacgtcc actt.ccaagg tnogrjcanac 1020 
\<! C)1.ACji a :l:gftgfii r.cagccagtc 10 80 

gcggacntac otyytgoccci w-^tl.^AtjAri 1140 

1159 



ucpo n tjc? n i jtj a i. t$Ag g ^gggcrgaggg 60 
cctcattcct gcccctgtct tgggrgaeto 120 
ggaagAt.gaa gggcttctcc crgcxggccg 180 
cctcagtgtc t rjt t unc* tq <:W-dtQqc:& 24 il 
aggacaactt ctcggtgact caagtgccct 300 
agcctcactn tgcutctncK: utg<3a^aegg 360 
tcaactggar gaagaaactg tcucccoggt A 20 
gogccacact qUmi.qatxiA ggtrtggggge 4 80 
ccttgggtgc agaqcagggu. ytaqg^ogitg^ 5*0 

568 
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<212> DKA 
<220> 

<221> mice future 
<?,?.?.> (IhT ] . . <2$h) 
<223> ex on d cf A til* gene 

■c40D> 1 

cccaaggggg aggatgactt agctgggtga tggggggagc aaacatccct gtcgtsaggg 60 

t t<iqnt.ciw i*«jMi}:*-Ml.wdQ ^:T.gc-r;r.ggc: ctctgggaga gcc<:t.r:aotg t.gtggcctgg 12l> 

agccttccta sctgtgcatc atctccccwj iiac^ttcGiir.i <zxtivvrdtnv wx-oci &t qqt IPO 

^<;Lt^:ftaggiU f.r.tt atgaotr tgeaggaect gctcgcccag getgagctge ccgccattct 24 0 

gcacaccg<ng crgoacctgo azidtKitbntiq ra« lc)*i;i;i)c: * l.iviy tjqUjy yoqaQtj I.aI.ij 

L.ql.gagcr.i.g t.qt ^f.gtgc-c: tgacctgggt tccaagtgtg cacagggrgg gaggcatgga 360 

Trjt^ciriqucn: « r::.?q *<i I. a '.. gg gt g<g ggtcagc-agg gcaagaggga gcggagagta 420 

gggecaaagg tgggagagaa gtagc 4 45 

<210> & 
<211> 1091 
<212> DNA 

<213> Etoira? .^p i.on* 
<22D> 

<221> jnisc_f%atur© 
<223> exon 5 of ACT g&ne 



actgtgcatc 


ctcctggcag 


gctggaaaat 


gtccccct oo 


aggacogtgc 




&0 


ggtcctgagc 


ccoccccggc 




ttz&qii&ar.r.t 


gggtcaccca 


gtgcgccc-tc 


120 


agaatgatcc 


tgatgtctgc 


tgctttgcag 


gtgetgoaen 


gentt t tttt 




n*o 


gegg-atgaga 


gagagccc-nc 




caacagctta 


acraagectga 


ggtcttggag 


240 


gtgaccctga 


accgcccatt 


cctgtttgct 


gtgtatgotc 


onagcgccac 




300 


tt cctgggcc 




mC^ get gage 


acagcatgag 


gccagggccc 


cagaacacag 


36D 


tgcctggcaa 


ggcctctgcc 


cctggccttt 




uca <j go a g 


ataacaaccc 


42IJ 


og gotca aat c 




CA-rrccccagt 


ctcccacctt 


ttcttctaat 


gag-ccgnutt 


480 


t.gagctggaa 


ageagcegtt 


tctcottg^t 






gagcagtaga 


54 0 


agcctgcagc 


ggcacaaatg 




tttgctgggt 


ttattttaga 


gaatgggggt 


600 


^gggaggr;a<* 


gaaLTf:agtgt. 


ttagcgcggg 


act act gttc 


i;tartrfrt Ag^a t. 


tccaaccgac 


660 


cagctt gttt 


gtgaoocaa-n 


<?£agtgttcc 


cttttcaagt 


tgagaacaaa 




120 
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1 

x initio tutu*** c\\^\.n<\^\A,\. l.LtjL':Af„vgcc ttcggtttgr atttagtgtc ttgaatgtaa 760 

gaacatgacc tccgtgtagi gtctgtaata ccttagtttt ttccneagot r^ttcitgait 840 

\.\\A±f*Hr.<iH\. (ji^-cjaAA^ia tgcaagcacc tgaatxtctg tttgaatgcg gaaccatagc <>00 

tggttatttc T.cccttgtgt tngtaatcaoy cqtcttrjci^i i;* .<» L * .?q i;*,-: I . uwati^Lt t }(i0 

tatctitcat ttagcagcca aacagatgta tacaattcag cagatagact gtgcaaacga 1020 

aagtgctttc ctggacttcg ciatgqtinttt c w tq w *w i ^Ujmji^Kq l./»<:M^<jiXacj 10*JO 
tectttgaag t 1091 

<210> & 

<211> 2099 

<?.}.?.> DNA 

<2l%> Homo sapiens 

<2<>0> 

<221> CDJ3 

<222> M01 - . VLt$4) 

<22V> 

<22'1> Riq^pepr.i rte 
<222> MD>- . (138) 

<4D0> 6 

aagaagctgc cgt.r.gr.l.i^r.g ggt a eta cag cagaagggt atg egg aag cga gca 54 

Met Arq Lys Axq Ala 
1 5 

cere cag tct gag al.g g^r. cct gec ggt gtg age ctg agg gec acc ate 102 
Pro Gin Sex Glu Met Ala L'ro Ala Gly Val Sex Uu Arg Ala Thr lie 
10 1!> 20 

etc tgc ct^ ct.g grrt: egg get ggc ctg get gca ggt gac egg gxg tac 150 
Leu Cys leu Leu Ala 'Jxp Ala Gly Leu Ala Ala Cly Asp Arg Vnl Tyr 

25 3Q :*.!> 

ata cac ccc ttc cac etc gtc ate cac aat gag agt arc tgt gng ong 198 
Ha Hi 's Pro Phe His Leu Tie Hi« Aa™ Gl u TJix- CyA Glu Gin 

4 0 4 5 50 

ctg gca aag gec aat gec ggg a*>» qc?u vi;r.: c?i;c ti-^: Oil. 246 

leu Ala l.y$ Ai* Ayn A1a Gly lys Pro Lys Asp Fro Thr Phe lie Pro 
55 60 65 

get cca att cag gec aag aca tec cct gtg gat gnu. ang ijee ets Dfi? 294 
Al a Pro lie Cln Ala Lys T^* ^er Pro Val Asp Glu lys Ala leu Gin 
70 Sh BO 95 

gac cag ctg gtg eta gtc get i%c^ m*.* t:fcl". q««: ai^: gaA gac aag ttg 3d 2 
»-*;e> G*n Leu Val Leu Val Ma Ala Lys Leu Aep Thr Glu Asp lys Leu 
90 95 100 



aag gee gca atg gtc ggg ctt<j czt.Q ycc aac. tr.c ttg ggc ttc cgt ata 
Ly$ A3* Ala Met VllI GJ.y M^t Leu Ala Asn fc^e Leo Gly Phe Arg He 
10b 110 115 



390 



8 



t.^t tj<gi-. al.Q agt gag eta tgg ggc gtg gtc cat ggg gec acc gl.<: <:iH 

Tyr- Gly Met rtis Sex Glu Leu Trp Gly Val Val His Gly Ala Thr val 

120 3^.S i.in 

cic tec cc3 acq get gtc ttt ggc ace ctg gec tct etc tat ctg gga 48b 

Leu Scr Pru Thr Aljy V« I Phfc G.ly Thr l.yg ftor l.**u Tyr* T>*-u y 

tiO las 

gee ttg gac cac aca get gnc ngg eta cog gca ate ctg ggr gtt cct 534 

Ala Leu Asp Hi& T;jr ?T.ti At;p Ar<j l.vu Gin TV L*j Tl<; Lcu GJ y Val Pro 

1.M 15S 160 165 

tgg aag gac sag aac tgc ace tec egg ctg gac gcg cac aag gtc ctg 582 

Trp Lys Asp Lys A.^tl Cy* Thr Pkj- A?h T.<hi At;p 7M « Wiy T,y*; V^il Leu 
170 175 180 

tct gec ctg cog got gtc* euq 7ns ctq ctn qti Qce carr qye cun get 630 

S*r Ala Lftu Gin Ala Val Gin Gly Leu Leu Val Ala Gin Gly Arg Ala. 

IBS 1 90 

qat ^i?c e)<:i: <:^lj r.l.g i^l.g I:ci^ aiu> gLi> gl.Q gl.t> LCl: a*:* 6"7|J 

Asp sar Gin Ala Gin lieo leu Leu Ser Thr Val Val Gly Val Phe Thx 

200 205 210 

er-^ i^jii cl-y c»v i-l-Q cag ccg tt.t gcg cag ggti ctg get ntr. 726 

Ala Pro Gly his Jbeu lys Gin Pro Phe Val Gin Gly Leu Ala Leu 

215 220 225 

t.Al col. <j\.g <alr: m.c cca cgc tct ctg gac ttc aca gaa ctg gat 774 

Tyx Thx i'xo Val Val .Leu Pro Axg Ser Leu Aap Phe Thr Glu Leo Asp 

230 235 5 

gtt get get gag aag att gac agg ttc atg cag get gtg aca gga tgg 822 

Val Ala Ala Glu l*ys He Asp Axg Phe Met Gin Ala Val Thr Cly Trp 
250 Zbb 260 

aag act ggc tgc tec ctg acg gga gec agt gtg gac age ace ctg gci 870 

Lys Thr Gly Cys Ser Leu Thr Cly Ala Ser Val A^p -Scr Thr Leu Ala 

2 65 2'*Q 275 

ttc aac acc tac gtc cac ttc caa ggg aag atg aag ggc ttc tec ctg 31 B 

Phe Asn Thx Tyx Val Bis Phe Cln Cly lys Met lys Cly Phe Ser Leu 

28D 28S 290 

ctg gec gag ecc cag gag ttc tgg gtg gac aac age acc tea gtg tct 966 

Leu Ala €lu Pro Gin Clu Plic Trf> Vyl A-^i &*r Thr VaJ. S<y.r 
£9b 305 

gtt ccc atg etc tct ggc atg ggc ncc ttc cag cue tgg arjt qsc «tc 1014 

Val Pro Met Txm Sex Gly M^t Gly Thx Pb* Gin His Trp S&r Asp He 

33 0 34. S 320 325 

cag gac aac ttc teg gtg act oci^ nty coc ttc wr-t y»c? »nn <ji^: 10&2 

Glrs Asp Asn Phe Ser Val Thr Gin VaI Pro Phe Thr Glu E&r Ala Cys 
33H 335 340 

ctg ctg ctg ate cag cct cac tnt qcc tct rj^i; i;l.<j cjai: ftrt^ gtg ga^ 11)0 

Leu LF?ut Leu lie Gin Pro Y**& Tyr Ala Ser Asp Leu Asp l^ys Val Glu 

345 350 355 
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ggt etc act ttc cag caa aac tec etc aac tgg atg aag aaa ctg tci 1158 

Gly Leu Thr The Gls Gin Asn Ser Lsju Asn Tip Mot Lys Lys Leu Ser 

■ISO 3^ :*'/n 

ccc egg acc ttc cac ctg acc atg ccc caa ctg gtg ctg caa gga tc: 1206 

Pro Arg Thr He His Len Tbx M^t Pro Cln leu Veil Lc\i Cln CLy Ser 
37.^ 

tat gsic ctg cng gac ctg etc qcc cur? qnt noq ctq cc-c ncn cttt c*rj 1254 

Tyr A±p Lew gi.'j A«;p Leu Leu Ala Gin Ala Glu Leu rro Ala He Leu 

390 " 335 400 403 

cac i)Afj o.l.q r:I.C| AAA t.1:g age: a*f. g^r aqn ai.fi- rtfjg <jti? 130? 

Jlis Thx Glu Leu Leu (Sir. Liys Leu Ser Asn Asp Axg lie Arg Val 

>| 10 415 420 

qrjq r^i] rjl.i* i;.lq oac* aqu W: I. CjAq fit 1. g*a ^r.rj <ja<: gai} a>^ 1350 

Gly Glu val Leu Asn J3er lie Phe L'he Glu Leu G la Ala Asp Glu Arg 

425 430 435 

gag 1 enc: aviA q*q l.r.i. r*n^ caa cag ctt aac aag cert gag gtc ttg gag 

Glu fro rhr Glu Ser Thr Gin Gin Leu Asn Lys Pxo Glu Val Leu Glu 

440 445 45" 

gtg acc ctg aac cqi': cca ttc ctg ttt get gtg tat gat caa age gee ld$£ 

Val Thr leu Asn Arg Pro Phe leu Ph* Aln Val Tyr Asp Cln Sor Ala 

455 4 60 4 65 

act gec ctg cdf: l.l.f: ntg ggc cgc gtg gee aac ccg ctg age aca gca 1494 

Thr Ala Leu His l*he i>eu Gly Arg Val ALa Asn Pro Leu Ser Thr Ala 

47Q 475 455 

tgaggecagg gceccagaac acagtgcctg gcaaggcctc tgcccctggc ctttgaggca 1554 

aaggccagi^a goai^tfA^a acnccggaca aatcagegat gtgtcacccc cagtctccca 161 a 

ccttttcttc taatgagtcg actttgagct qcisoagcagc cnjtttctcct tgrjtotaagt 1674 

gtgctqcatg g^ gt.<a & q. r.agaagcctg cagcggcaca aatgcacctc ecagtttget 1734 

gggtttattt tagagaatgg gggtggggag gcaognaocii qt-cr^-t ^qqaotact 17 94 

gttccaaaaa gaai.ti?C^Aix ^gaccagctt gtttgtgaaa caaaaaagtg txcccttttc 1854 

aagttgagan castaaattgg gtttt zxsaot ttftteijtotnc »tM:'..t.c)wt I; m't^t-tegg t 1914 

ttgtatttag t gt. r. M: m I. gtaagaacat gacctccgtg tagtgtctgt aataccttag 1974 

ttttttccac ogatgettgt gntttttgcia uc»c?t9cqt n<? » w ^ ft 1 - * « kqe.^tt 2034 

tctgtttgaa tgcgg^AOCA tagctggtta tttctccctt gtgttagtaa taaaegtett 2094 

gecac ^099 

<210> 7 
<2U> 4P5 
<212> PRT 

<213> howo sapiens 



10 



ojD0> 1 

Met 7\rq Lyft Arcj Ala t^ro Gin Ser Clu Met P>*> Aia Gly Val 5ex 

Ar^ Ala Thr lie i,eu Cys Leu Leu Ala Trp Aia Giy L*u Ala Ala 
20 ^ 30 

Gly Asp ?\r-<3 VAI Tyi lie Hie i'xo Phe His leu V a l 13?. Hi* 7\nn GUj 
3D 4 0 -aS 

fcer Thr Cys Clu Gin i,au Ala Lys Ala Asn Ala Gly Lys Pro Lys A:*p 
511 GO 

Fro Thr Phs Ho Pro AJ.a Pn> 11* Gin Ala Lys Thx Sex Pro Val A3p 
65 ?0 75 » n 

GLu Lys Alo Un« G1 ■> A«P Gin Leu Val Leu Val Ala Ala Lys Leu Aj?t> 
B5 90 « 

rhr Glu Asp Lys Leu Lyx K\v Ala ttet Val Gly t*et Leu< Ala Asn Phe 
1IU> L05 13 0 

l,*u Gly The Ar 9 Lie Tyr Cly Met Hi* Glu Leu Txp Cily Val Val 

1L5 120 125 

His Gly Ala Thr, Val Leu ser Pro Thr Ala Val E'hc GHy Thr L*u Ala 
130 L35 140 

Sor Ixsi> Tyr Gly Ala Leu Aap His Thx Ala Asp Ar^ Uig Gi n Ala 

IdS 150 15b 160 

lie Leu Gly Vfl]. Pro T*p Lys Asp Lya Asn Cys Thx Sex Aril Leu a*f-> 
16* 170 1?* 

Ala His Lys Val Leu Ser Alo Gin Ala Val Gin Gly Leu Leu Val 

1P0 1B5> 190 

Ala Gin Gly Arg Ala Asp £er Clx» M» I-*i> Leu Leu .Ser Thr Val 

195 200 205 

Val Gly Val Phe rhr Ala Pro Cly Leu R:i* T. w Lys Gin Pro Phe Val 
210 21b 220 

Gin Gly Leo Ala Leu Tyr Thr Pro Vyj Val l.*o Pro Arg Ser Leu Abp 
225 230 235 240 

Phe Thr <3lu Leu Asp Val Ala Ala Clu Lys Il« Asp Arg Phe Met Gin 
24b ?-50 25 D 

Ala Val Thx Gly Trp Lys Thr Gly UyB Sex Leu Thr Gly Ala Ser Val 
260 265 270 

Asp sex Thr Leu Ala L>he Asn Thr Tyr Val 33ts The Gin Gly Lys Met 
275 280 285 

I.yft Gly Phe ser Leu Leu Ala Civ Pri? Gin Glu P/je Trp Val Aap Asn 
2S0 295 300 

Ser Thr Sur V tt .[ .5<*r; Val Pro Met Leu 5c r Gly Wei- Gly Thr Phe Gin 

305 aio 320 



WO0Q/7t751 



PCT/USO 0/1 3327 



11 



Hi* Tip Lfer Asp lie Gin Asp Asr. Pho Sex Vol Tlir G.ln I Pm Phc 
32 5 :*:jn 13 b 

Thr Glu ser Ala Cys Leu Leu Leu lie Cln Pxo Hi* Tyr Al# f?nr Asp 

Lev A-t;p T-yrt VaJ Glu Gly Leu Thr Phe Gin Gin Asn Sox Leu Aisn Trp 
3DI> 360 :*f>^ 

Met Ly^ J.y.r. Sfcr Pro Axg Thr lie his Leu Thx Met Pro- Gin Leu 

370 375> 3«H 

Val Leu Cln Cly Scr Tyr Asp Leu Glu A&p Leu Leu Ala Gin Ala Glu 
385 390 395 400 

Leu Pro Ala He Leu Hix Thr Glu Leu A<tn Leu Gin Lys Leu h.er Asn 
fll}J> -110 415 

Asp Axg lie Arg val Cly Cln VcO l.«u A*;u lift The cslu Leu 

420 425 430 

Glu Ala Asp Glu Arg Cl\i Pxo Thr RJ.u S*-r Thr Gin Gin Leu Asn Lye 
43.5 4-10 445 

Pro Glu Val Leu Glu Val Thr Lou Aon Arg Pro Ph« l.vu Ph* Ala Val 

450 455 4€0 

Tv A.^]> Glu Ala Thr Ala Leu J3i3 Phe Leu Cly Az<j Vol Ala A«n 

46* 410 475 «60 

Pxo Loi> 5ur Thr Ala 
485 



<210> S 

<211> GB1 

<212> DMA 

< 2 1 3> H onto *c*p i en* 
<220> 

<221> mis cofeature 

<22Z> (151)., (531) 

<223> alterjar.ft exon 1 of AGT gene 

<400> C 

agggegggga aggggggagg aa-gggggnct gesgautetw tQLll.frta I: a t.g^tatgatg 60 

ggatttg-atx: tyutictt t-yeti <sctttcctca aacacaaggc acaaacetgc ccc^tjcO^dV. 1X0 

catcrct.cnca ca gaggcart gtgaatgtat ctaagctaacjqi;. tytt.gct.ea.gA aacgtgctgc 180 

aggggaggct ggcacattct g etc outage taaccgcaca ggcccatggg tccacacac* 240 

tcagaccatg gctctctgtn gtctatcttt gecattaett tanr*c^r?t^:l-. £t:ttct.catg 300 

cgctcatgga gggtgactga agat rjL*:4C^ ocgaggtgca ccctcagcct cgctggacac 360 

ttncqqot^ c-yt-tr^t c^ag ccggaccagg cn gt t rj<»q<q n *<l t- l: i:ga a gtccagctgg 420 
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agctcctcat r.ci)C* L ar.it I. gaqrracaatc agctgcagga g g t ggc! get cagctgaaag 480 

r.iM.Qr.gggga caaaggagga gacccag^r:. li^<H.^ l.al.i*: r.gag^cr;cga ggtgagtctg MO 

cacctt bijrtij ^tjAqc-tccca ggcagctgag gagagggcrc fniwi^ijya qunoot.cjl; 1 1- t>0U 

cactcctcoii aqc-~()<]i)L)l.i: <"i:cagacc^n gtgggactgc zgcscttggt gtgantgcag 660 
i:ql.Qagagtg tgaatctgtg g 661 

<21U> 9 
<21i> IB 
<212> DNA 

<213> Arl.il ir.iAl Sequence 
<220> 

<223> Description *>! Artificial Sequence :M13 forward 
primex 

<4D0> 9 

<210> 10 
<:ni> 1.9 
<212> t>y\h 

<213> Artificial Sequence 
<220> 

<223> Description of Art.! li.fcial Sequence tM\ 3 reverse 
i infer 

<4 0D> ID 

aggaaacages tattjacnat 19 

<210> 11 

<an> 20 

<212> DNA 

<213> Homo sapiens 

<4A&> 11 

cctcgaagtg gcctatggaa 21) 

<2LG> 12 
•C2L1> 2Q 
<212> DNA 

<223> Homo sapiens 
<400> 12 

gegagactet atgggtgatg 2D 

<210> 13 

<211> 20 

<2l2.> DNA 

<213> tiomD sapiens 
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<40ti> 13 

gactgogncl vLw:l.gt:tt.c 



<2)0> 14 
<211> 19 

<21'A> Homo sapiens 
<&,OD> 14 

i -#gq gtggca t c 1 1 cag t c 



<210> 15 
<211> 20 
<21Z> HNn 

<213> Bono sapiens 
<4 00> 1* 

t ci9c«i.:Al.l:r: T.grrtocttag 



<210> IS 
<211> 20 
<2I2> UNA 

<4 00> 16 

at acagcga rj t ttct 3<i y to I*. 



<210> 17 
<211> 20 

<213> floino 
t$ac crag g cag 



sapiens 
ttgaggaagt 



<2HO> 1$ 
<211> 20 
<212> DNA 

<400> IB 

gattcacact ot r>a uyustyo 



<2L0> Id 
<211> 20 
<22^> 0M7\ 
c2l3> >lomo 

<400> 19 



sapiens 



<210> 20 
<2ll> 20 



14 

<2L2> DNA 

<213> Hrjiiii) .sapiens 
ctgctattur: vtjnt.gatggc 



T210> 21 
<:213> «0 
<2l2> UNA 

<213> Homo snpir.jitt 
<4CW> 21 

ariaggcccat gtgagatatg 



<ZU1> 22 
<211> 20 
<212> on* 
<213> Komo 

<4 00^ '22 
t<agtattagg 



sapiens 
gcactgaagg 



<210> 23 
<211> 2Q 
<212> DtfA 

<213> Homo i-apicn;* 
<dQ0> 23 

tgtgttgaog cng^^t^tt^ 



<21D> 24 

<213> 2C1 

<212> DNA 

■C213> homo sapiens 

<:4 0O> 24 

<:210> 25 
<211> 2L 
<212> I>NA 
<2l3> «<xmo 

<400> 25 

<220> 26 

<211> 20 

<212> DNA, 

<213> Homo sapiena 

<400> 26 
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<210> 21 
<2'12> UNA 

<213> Bvsnii Artpiens 
<4G0> 27 

cat gt coot g t<ji*<;i:l<: I r - 



<210> 28 
<211> 

<213> Homo sapiens 
<4 00> 

l:g^g<jggt gg ggatggag IB 



<2] D> 2$ 
<2il> 20 
c212> DMA 

<213> Homo Arift 
<4 00> 2 9 

ctggtcsat.<|t. ijAAfti.-ttacc 



<:2T1> 2fJ 
<212> DMA 

<213> Hanio ^picn.*; 
OJD£}> 30 

ace* tea get gctccaaag*! 20 



<210> 31 
<211> 20 
<212> DMA 

<213> Eiorno sapiens 
<-400> 31 

cc*c 1 1 I. r.i; a qc a t gt. t a cc 



<:210> 32 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<4G0> 32 

ctgctattcc tcctgatggc ^ 



<210> 33 
<21L> LB 
<212> D»A 



16 

<40Q> 33 

aaccctcctc tccsgcct 16 



<2lfJ> 34 
<211> 18 

<2i:-!> Homo Aftpiens 



<40Q> 34 

* r-c» * -c:^ i u*-<3 Q C:C t CC t C 



<40Q> 38 

L & gg qctctt.t. teat ccac a g 



<2L0> 39 
<211> 20 
<212> DNA 

<2L3> Homo e a pi ens 
<:400> 

as t geeggga a gecca a a ga 



is 



<210> 3.5 
<.2ll> 20 
<212> DNA 

<213> Homo vc>j>itfn$ 
<d00> 35 

tacttggact tt^gqetcjan -20 

<210> 36 
<211> 21 
<212> 

<21-i> Homo sapx&ns 



2) 



<400> 30 

ti^c<i>C3ag<i oairtttgttt c 

</A \\> :i? 
<2il> 21 
<2L2> DNA 

<2]3> Honin sapiens 
<4 0O> 37 

ggctaaatgg tgaca<igc}{i« l- 21 

<210> 38 
<211> 20 
<212> PW 

<213> Ham© sapiens 



20 



20 



<210> 
<211> 



40 
20 
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<2i2> JPWA 

<213> Memo ycinicrv.? 

^ccaaggctc ccnqntcKiag 2£i 



<210> 41 
<2D1> 
<2l2> DMA 

<213> Homo sapiens 
<4 00> 41 

i;l*.r.t.t:r:rrr:aa c ggc tgt czt t. 20 



<r.2l0> 42 
<211> 20 
<22Z> PNA 

<4 00> 4? 

<:l,i)CAr.fl£*tt qc*<:r.i)Cl.tC« 20 



<210> 43 

<2ll> 20 

<212> DNA 

<213> Homo sapiens 

<400> 43 

ctctatctgg gagccttgga 20 

<210> 44 

<211> 20 

<212.> DNA 

<213> Korea sapiftrt£ 

<400> 44 

eecacfrccl.qi^ t rjcxa ret gt 20 

<211> 20 

<212> DNA 

<2~1%> Romo sapiens 

<40D> 45 

tgaagcagee gtttgtgcag 20 



<210> 46 
<:211> 19 
<212> DNA 

<2L3> HOMO .';apj.etri^ 



<400> 46 



19 



18 



<210> 47 
<211> 20 

<213> Homo 

<400> flV 
wi^A^gtitcat 



gcaggctgtg 



<?.1\}> 4E 

<211> 19 

<212> DNA 

*'4 0O:> 4B 



<210> 43 
<2JJ> 51 
<212> Dtik 

<213> Homo sapiens 
<400> 4 5 

tgcctcagtg aar.tuttdal.Q <3 



<210> SO 
<211> 20 
<2*)2> DNA 

<223> Homo sapiens 
<4 00> 50 

Ct.r:cccrctcc aaaactacca 20 



<2l0> 51 
<21L> 1£ 
<212> DNA 
<Z1.'J> Hcwci 

<4O0> 01 
caggcigaatg 



ccagagt ggc 



<2L0> 52 
<2U> 19 
<212> DNft 
<213> Homo 

<JQ0> S2 
gcaggcgctc 



sapiens 
tcagtgaag 



<210> S3 
<211> 2D 
<212> DNA 
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<4IJU> 53 

aq ca ct gga g 1 ■ v» " £ cc a g 



20 



<210> 54 
<21l:> at: 

<213> Homo .'.uif= : iwift 
<4t>0.> 

gatggaggac tgcit.^vacag ^° 

<211> 20 
<2L2> PN* 

<2i:*> Jlcono sapiens 
<4 00> 55 

tfsi^jgcfgagg atgacttagt 20 

<2Jll> 
<2L1> 21 
<212> DNft 

<"213> Hoano plane 
<d00> 

gaastttcxvrt oi^agtcca g 

c21P> -S'/ 
<?.11> 20 
<212> DNA 

<213> flr^ino .^Apl-en^ 
<JQ0> !>7 

gtgcaaggag cotanguc.*!*} 20 

<210> SB 
<21L> 19 
c*212> DWl 

<2L3> homo sapiens 
<4 0O> 50 

ct.Actctccg ctccctctt 1<s 



<210> 59 
<2H> 20 
<212> DWA 

<213> Hocno sapiens 
<4fl£l> 59 

aagggggagg □ tga.cttpgt 20 



<220> 60 
<211> 21 



V¥U 1*1/7 J 725 J 



20 

<212> DNA 

<213> Homo sapiens 

*.at>0> 60 

gaaattccat ccaaagtocu <? 



<210> 61 
<21i> 17 

<*213> homo sapiens 
<4DD> 31 

t.a^jcrcctrcag caccctg 



<iJ0> 62 
<211> 19 
<212> DWA 

<^15? ]]omt> eapiexis 
*400> 62 

UlLCi-QgggT. tgt.tatctg 19 



<2]r» s:* 

<211> 20 

c212> DNA 

<213> Hoiou 

<aoo> 63 

agaacacagt 



goctggcacig 



*210> 64 
<211> 20 
<212> DNA 

<213> homo sapiens 
<40Q> 6-0 

ttcacaaaca agetggtegg 2U 



<21D> 65 
<211> 20 
<212> DNA 
<213> Homo 

<4 00> 65 
agtgtttagc 



<21D> 66 
<21L> 20 

<213> Homo 



sapiens 



<400> 66 

tt L.ggagg^l. 1-Attgr.gg^A 



20 
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<2U» 6? 

<2ll> 20 

<212> DNA 

<213> R<ntu> sapiens 

<dOO> 67 



<210> 68 

<2ll> 20 

<%V*> DNA 

<213> Komo sap iorv; 

<4 00> 

gaaagcactt tcgtttgcac 
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